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NUCLEIC ACIDS ENCODING 
PROTEASES 



RELATED APPLICATION 

This application claims priority to U.S Provisional Application 60/332,633, 
5 filed November 6, 2001, the entire teachings of which are incorporated herein by 
reference. 



BACKGROUND OF THE INVENTION 

A "protease" or "protease enzyme" is a protein which has proteolytic 
activity. These enzymes have important applications in industry, biology and 
10 medicine. 

Industrial applications of proteases include food processing {e.g., serine 
proteases are used to produce dairy products and protein-rich concentrates from fish 
and livestock), brewing, and alcohol production, and are common additives in 
laundry detergents {e.g., the serine protease subtilisin is used to remove protein- 

15 based stains). They are also used in research, e.g., to degrade undesirable proteins 
during purification processes. 

In nature, proteases are involved in a wide array of biological pathways, in 
regulatory capacities, as treatments, or in pathological capacities. Tissue 
plasminogen activator (t-PA) and thrombin, for instance, are commonly used in 

20 pharmaceutical settings, while improper regulation of elastase can result in lung 

problems such as emphysema. Proteases have also been implicated in the occurance 
of many diseases, including Alzheimer's disease, cystic fibrosis, muscular 
dystrophy, arthritis, emphysema and other respiratory ailments, gastrointestinal 
diseases, hypertension, degenerative skin disorders, tumor invasion and metastasis 

25 and also viral-associated diseases, where they are important in viral maturation. The 
aspartic acid proteases pepsin and gastricsin are secreted into the stomach for food 
digestion, and are diagnostic indicators for stomach ulcer and cancer. Collagen and 
elastase degrade, respectively, the structural proteins collagen and elastin. Many 




WO 03/040393 PCT/IB02/04615 

-2- 

elastase degrade, respectively, the structural proteins collagen and elastin. Many 
human pathogens (e.g., Haemophilus influenza, Neisseria gonorrhoeae, 
Streptococcus sanguis, Streptococcus pneumonia, Neisseria meningitides) possess 
proteases capable of cleaving the hinge region of human IgAl, which is a class of 
5 antibodies found in mucous. These proteases are therefore targets in preventing 
infections and pathogenesis by these organisms. 

Proteases are also involved in cellular responses to starvation, heat-shock, 
and other stresses in which cells might find it advantageous to break down proteins 
to salvage component amino acids. 

1 0 SUMMARY OF THE INVENTION 

The present invention relates to human protease genes, particularly nucleic 
acids comprising protease genes, and the amino acids encoded by such nucleic acids. 
These sequences are shown in Appendix I. In Appendix I, each protease entry lists 
the University of California at Santa Cruz contig designation from which the 

15 sequence was analyzed {e.g., "ctgchrllq_l"), the name {e.g., "MOOSE13873"), the 
exon locations {e.g., "7240429 . .7240464, . . ."), following by the amino acid 
sequence and the nucleic acid sequence. 

Sub-family information on the sequences is shown in Appendix n. For each 
sequence, the following information is provided: the name {e.g., "MOOSE13873"), 

20 the University of California at Santa Cruz contig designation from which the 
sequence was analyzed {e.g., "ctgchrl lq__l")> and the subfamily to which the 
sequence appears to belong. The assignments were made on the basis of the best E- 
value with which the sequence aligned. 

In one embodiment, the isolated nucleic acid molecule comprises a 

25 nucleotide sequence selected from the group consisting of SEQ ID NOs:l-47, as 

shown in Appendix I, and the complements thereof. The invention further relates to 
a nucleic acid molecule which hybridizes under high stringency conditions to a 
nucleotide sequence selected from the group consisting of SEQ ID NOs:l-47, as 
shown in Appendix I, and the complements thereof. The invention additionally 

30 relates to isolated nucleic acid molecules {e.g., cDNA molecules) encoding a 
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protease polypeptide (e.g., encoding a polypeptide selected from the group 
consisting of SEQ ID NOs:48-94, as shown in Appendix I). 

The invention further provides a method for assaying a sample for the 
presence of a nucleic acid molecule encoding all or a portion of a protease in a 
5 sample, comprising contacting said sample with a second nucleic acid molecule 
comprising a nucleotide sequence encoding a protease polypeptide (e.g., one of SEQ 
ID NOs: 1-47, as shown in Appendix I, or the complement of one of SEQ ID NOs: 1- 
47; a nucleotide sequence encoding one of SEQ ID NOs:48-94, as shown in 
Appendix I), or a fragment or derivative thereof, under conditions appropriate for 

10 selective hybridization. The invention additionally provides a method for assaying a 
sample for the level of expression of a protease polypeptide, or fragment or 
derivative thereof, comprising detecting (directly or indirectly) the level of 
expression of the protease polypeptide, fragment or derivative thereof. 

The invention also relates to a vector comprising an isolated nucleic acid 

15 molecule of the invention operatively linked to a regulatory sequence, as well as to a 
recombinant host cell comprising the vector. The invention also provides a method 
for preparing a polypeptide encoded by an isolated nucleic acid molecule described 
herein (a protease polypeptide), comprising culturing a recombinant host cell of the 
invention under conditions suitable for expression of said nucleic acid molecule. 

20 The invention further provides an isolated polypeptide encoded by isolated 

nucleic acid molecules of the invention {e.g., protease polypeptide), as well as 
fragments or derivatives thereof. In a particular embodiment, the polypeptide 
comprises an amino acid sequence selected from the group consisting of SEQ ID 
NOs:48-94, as shown in Appendix L The invention also relates to an isolated 

25 polypeptide comprising an amino acid sequence which is greater than about 90 
percent identical to an amino acid sequence selected from the group consisting of 
SEQ ID NOs:48-94, preferably about 95 percent identical. 

The invention also relates to an antibody, or an antigen-binding fragment 
thereof, which selectively binds to a polypeptide of the invention, as well as to a 

30 method for assaying the presence of a polypeptide encoded by an isolated nucleic 
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acid molecule of the invention in a sample, comprising contacting said sample with 
• an antibody which specifically binds to the encoded polypeptide. 

The invention further relates to methods of diagnosing a predisposition to a 
condition mediated by a protease. The methods of diagnosing such a predisposition 
5 in an individual include detecting the presence of a mutation in a protease, as well as 
detecting alterations in expression of a protease polypeptide, such as the presence of 
different splicing variants of protease polypeptides. The alterations in expression 
can be quantitative, qualitative, or both quantitative and qualitative. 

The invention additionally relates to an assay for identifying agents which 
1 0 alter (e.g. , enhance or inhibit) the activity or expression of one or more protease 

polypeptides. For example, a cell, cellular fraction, or solution containing a protease 
polypeptide or a fragment or derivative thereof, can be contacted with an agent to be 
tested, and the level of protease polypeptide expression or activity can be assessed. 
The activity or expression of more than one protease polypeptide can be assessed 

15 concurrently (e.g., the cell, cellular fraction, or solution can contain more than one 
type of protease polypeptide, such as different splicing variants, and the levels of the 
different polypeptides or splicing variants can be assessed). 

In another embodiment, the invention relates to assays to identify 
polypeptides which interact with one or more protease polypeptides. In a yeast two- 

20 hybrid system, for example, a first vector is used which includes a nucleic acid 

encoding a DNA binding domain and also a protease polypeptide, splicing variant, 
or fragment or derivative thereof, and a second vector is used which includes a 
nucleic acid encoding a transcription activation domain and also a nucleic acid 
encoding a polypeptide which potentially may interact with the protease polypeptide, 

25 splicing variant, or fragment or derivative thereof (e.g. , a protease polypeptide 

binding agent or receptor). Incubation of yeast containing both the first vector and 
the second vector under appropriate conditions allows identification of polypeptides 
which interact with the protease polypeptide or fragment or derivative thereof, and 
thus can be agents which alter the activity of expression of a protease polypeptide. 

30 Agents that enhance or inhibit protease polypeptide expression or activity are 

also included in the current invention, as are methods of altering (enhancing or 
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inhibiting) protease polypeptide expression or activity by contacting a cell 
containing protease and/or polypeptide, or by contacting the protease polypeptide, 
with an agent that enhances or inhibits expression or activity of protease or 
polypeptide. 

5 Additionally, the invention pertains to pharmaceutical compositions 

comprising the nucleic acids of the invention, the polypeptides of the invention, 
and/or the agents that alter activity of protease polypeptide. The invention further 
pertains to methods of treating conditions mediated by proteases, by administering 
protease therapeutic agents, such as nucleic acids of the invention, polypeptides of 
1 0 the invention, the agents that alter activity of protease polypeptide, or compositions 
comprising the nucleic acids, polypeptides, and/or the agents that alter activity of 
protease polypeptide. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to nucleic acids comprising proteases, and the 

1 5 protease amino acids encoded by those nucleic acids. 

Proteases are enzymes having proteolytic activity, and so comprise an 
extremely large class of enzymes having a wide array of functions. Outside of the 
cell, the proteolytic activity of proteases can be used in many different industrial 
processes, including food and beverage processing (e.g. , beer brewing, manufacture 

20 of dairy products, protein processing, etc.), and in detergent formulation (e.g., for 
removal of proteinaceous stains). New versions of such proteases possessing 
increased levels of activity or binding, would therefore be of increased utility in such 
industrial settings. Likewise, those proteases with pharmaceutical application, e.g., 
t-PA and thrombin, would also be more useful if versions were found possessing 

25 increased activity and/or more specific activity. 

Proteases are also involved in many cellular functions, in both humans and 
other organisms, of which some of the functions can lead to disease conditions. 
Where proteases are associated with pathogenesis, it is therefore desirable to 
interfere with the activity of such enzymes. For instance, where a disease or 

30 undesirable condition is due at least in part to the action of or mediation by one or 
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more protease enzymes (e.g., emphysema, cystic fibrosis, hypertension, etc.), a 
potential method of prophylaxis would be to prevent the protease associated with the 
disease from acting at all, or from acting at the time and/or with the ligand necessary 
to cause the disease or condition. For instance, for those pathogens which infect 
5 humans, e.g., by cleaving the hinge region of human IgAl, pathogenesis could be 
prevented by inhibiting the action of that protease in the pathogenic organism. 

With the availability of complete genomic sequences for many organisms 
today, including Homo sapiens, it has become clear that there is a need for data 
mining techniques to extract the information in them, e.g., gene prediction programs. 

10 Of these, the most successful ones are those based on the comparison of known 
protein or protein-derived information, or those that use expressed sequence tags 
(ESTs) to predict gene location and structure. 

One such algorithm is GeneWise. It bases its exon prediction on the use of 
Hidden Markov Models (HMMs) of proteins to be compared against a genomic 

15 sequence, so that the translation of the sequence will match the model in a similar 
way to other HMM profile searches (Eddy, Curr. Opin. Struct. Biol. 6(3):361-5 
(1996), and allowing the presence of long insertions as long as they include donor 
and acceptor site sequences at both ends. 

To take advantage of the algorithm, the models for different protein families 

20 must be built so that they represent the full-length sequences instead of the most 

common features in them. This is a major difference with existing HMM databases 
such as Pfam (Sonnhammer et al, Proteins 28(3):405-20 (1997), in which each 
model is built to represent a family of proteins as broad as possible with minimum 
overlap between them. 

25 In the present approach, the sequences were subdivided in several families so 

that the similarity inside of a group of them was over 50%. Given this approach, 
there are several points of overlap between different families when analyzing a 
sequence, so the discrimination must be done after the search is completed. 

Several resources that include expert-supervised classifications are used to 

30 select the best groups of sequences, e.g., the GPCRdb (Horn et al, Nucleic Acids 
Res. 26(l):275-9 (1998)), PKR (Smith et al, Trends Biochem. Sri. 22(ll):444-6 
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(1997)), NuclearRdb (Horn et aL, Nucleic Acids Res. 29:346-349 (2001)), IOCH (Le 
Novere et aL, Nucleic Acids Res. 27(l):340-2 (1999)), Enzyme (Bairoch, Nucleic 
Acids Res. 28:304-305 (2000)) and Swiss-Prot (Bairoch et al 9 Nucleic Acids Res. 
28:45-48(2000)). When none is available, or the sequences included in some 
5 groups are too distantly related, the grouping must be done manually, using the 
ClustalW (Thompson et al y Nucleic Acids Res. 22:4673-4680 (1994)) package to 
measure the distance between different sequences. 

The present model was built from multiple sequence alignments of the 
different protein families obtained with DiAlign 2 (Morgenstern, Bioinfonnatics 

10 1 5(3) :21 1 -8(1 999)). DiAlign works based on segment-to-segment comparisons 

instead of arbitrary thresholds for gap opening and extension, which makes it ideally 
suited for building models that represent an entire, full-length sequence, since the 
alignments built this way have more match states that would be assigned as insertion 
states when using other alignment algorithms. The models were built using the 

1 5 standard HMMer package. 

To search for new genes, a genome-wide scan was done on the the University 
of California at Santa Cruz sequences, using the Gene Wise algorithm. It translates 
the genomic sequence on the fly to proteins and can therefore maintain a reading 
frame through insertions and deletions. The algorithm also rewards gaps in the 

20 genomic sequence relative to the model if they are encapsulated within introns, like 
splice structure. 

For each superfamily of proteins, a classification was obtained in which the 
sequences are grouped by length and similarity. Each one of these groups was then 
used to build a HMM profile representing this group of sequences. This approach 

25 aims to have models that can represent the full length of the encoded proteins for a 
whole range of proteins, without being too specific for any one of them or being too 
general, as would be a HMM built for large groups of sequences. This classification 
was based either on existing expert-supervised classifications, or by retrieval of 
sequences and classification based on pairwise alignment distances. 

30 These models were then searched against the August 2001 fixed release of 

the Santa Cruz contigs using the Paracel GeneMatcher+ Hardware Accelerator with 
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the GeneWise algorithm. The sequences were chopped at 100 Kb with an overlap of 
1 Kb. Each one of the superfamilies required between 3 and 6 days to complete and 
generate results. The results represent the coding regions of the complete final 
protein as it is found in the organism. 
5 The cross-validation of the results was done in two steps. First, all of the hits 

with an E- value lower than 10" 8 that do not overlap with one another were selected, 
and in the event of overlapping, the one with lowest E-value was selected. After 
selecting all of those matches, the DNA sequences were compared against the 
RefSeq database (Pruitt et al 9 Trends Genet 16(l):44-47 (2000)) using BLAST 

10 (Altschul et aL. Nucleic Acids Res. 25:3389-3402 (1997)). 

Over 80% of the sequences were 90% or more -identical to an existing human 
RefSeq entry. The differences are usually due to picking the wrong model for a 
certain sequence that appears as a hit more than once in different families, being a 
different valid splice variant, which can be tested by comparing to the EST database, 

15 or by addition of a small last exon to complete the match instead of accept an stop 
codon in a previous one. In all of such cases, the results are easily and quickly 
corrected by eye. Very rarely the algorithm will actually make a wrong prediction, 
which is consistent with the expected behaviour (Guigo et al. 9 Genome Res. 
10(10):1631-42(2000)). 

20 Of the remaining sequences, over 50% have a match over 90% identical in 

the public domain protein databases, and the differences between those sequences in 
the databases and the potential translations is basically the same as the differences 
between the DNA sequences and the RefSeq entries. 

A number of the genes were found to be linked with markers known to be 

25 associated with human diseases genes. These are shown in Appendix HI. The 
diseases were linked to the HMM genes in the following manner: (1) the HMM 
gene models were compared to the consensus of the human genome sequence, 
located and the results kept in a relational database; (2) all possible markers 
(Sequence Tagged Sites (STS's)) (public or deCODE genetics) are also located in 

30 the same consensus using ePCR or BLAT and results kept in a relational database; 
and (3) LOD scores for diseases are linked to markers. A span of one LOD drop 
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around the marker was also given. A computer program takes each LOD peak and 
links it to the consensus through the markerhit in the database. The database is then 
queried for all HMM genes within the span of one LOD drop or a minimum of 15 
Mb in each direction from the marker. The output is the name of the peak marker 
5 and its distance to the HMM gene. 

The full sequences of the protease genes and splice variants are shown in 
Appendix I as SEQ ID NOs: 1-47. The amino acids encoded by these nucleic acids 
are shown in Appendix I as SEQ ID NOs:48-94. 

NUCLEIC ACIDS OF THE INVENTION 

1 0 . Protease Nucleic Acids, Portions and Variants 

Accordingly, the invention pertains to isolated nucleic acid molecules 
comprising human protease genes. The term, "protease", as used herein, refers to an 
isolated nucleic acid molecule selected from the group shown in Appendix I, and 
consisting of SEQ ID NOs:l-47, and also to a portion or fragment of the isolated 

1 5 nucleic acid molecule {e.g. , cDNA or the gene) that encodes protease polypeptide 
(e.g., a polypeptide selected from the group shown in Appendix I, and consisting of 
SEQ ID NOs:48-94). In a preferred embodiment, the isolated nucleic acid molecule 
comprises a nucleic acid molecule selected from the group consisting of SEQ ID 
NOs:l-47 or the complement of such a nucleic acid molecule. 

20 The isolated nucleic acid molecules of the present invention can be RNA, for 

example, mRNA, or DNA, such as cDNA and genomic DNA. DNA molecules can 
be double-stranded or single-stranded; single stranded RNA or DNA can be either 
the coding, or sense, strand or the non-coding, or antisense, strand. The nucleic acid 
molecule can include all or a portion of the coding sequence of the gene and can 

25 further comprise additional non-coding sequences such as introns and non-coding 3' 
and 5' sequences (including regulatory sequences, for example). Additionally, the 
nucleic acid molecule can be fused to a marker sequence, for example, a sequence 
that encodes a polypeptide to assist in isolation or purification of the polypeptide. 
Such sequences include, but are not limited to, those which encode a 
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glutathione-S -transferase (GST) fusion protein and those which encode a 
hemagglutinin A (HA) polypeptide marker from influenza. 

An "isolated" nucleic acid molecule, as used herein, is one that is separated 
from nucleic acids which normally flank the gene or nucleotide sequence (as in 
5 genomic sequences) and/or has been completely or partially purified from other 
transcribed sequences (e.g., as in an RNA library). For example, an isolated nucleic 
acid of the invention may be substantially isolated with respect to the complex 
cellular milieu in which it naturally occurs, or culture medium when produced by 
recombinant techniques, or chemical precursors or other chemicals when chemically 

1 0 synthesized. In some instances, the isolated material will form part of a composition 
(for example, a crude extract containing other substances), buffer system or reagent 
mix. In other circumstances, the material may be purified to essential homogeneity, 
for example as determined by PAGE or column chromatography such as HPLC. 
Preferably, an isolated nucleic acid molecule comprises at least about 50, 80 or 90% 

15 (on a molar basis) of all macromolecular species present. With regard to genomic 
DNA, the term "isolated" also can refer to nucleic acid molecules which are 
separated from the chromosome with which the genomic DNA is naturally 
associated. For example, the isolated nucleic acid molecule can contain less than 
about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of nucleotides which flank the 

20 nucleic acid molecule in the genomic DNA of the cell from which the nucleic acid 
molecule is derived. 

The nucleic acid molecule can be fused to other coding or regulatory 
sequences and still be considered isolated. Thus, recombinant DNA contained in a 
vector is included in the definition of "isolated" as used herein. Also, isolated 

25 nucleic acid molecules include recombinant DNA molecules in heterologous host 
cells, as well as partially or substantially purified DNA molecules in solution. 
'Isolated" nucleic acid molecules also encompass in vivo and in vitro RNA 
transcripts of the DNA molecules of the present invention. An isolated nucleic acid 
molecule or nucleotide sequence can include a nucleic acid molecule or nucleotide 

30 sequence which is synthesized chemically or by recombinant means. Therefore, 
recombinant DNA contained in a vector are included in the definition of "isolated" 
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as used herein. Also, isolated nucleotide sequences include recombinant DNA 
molecules in heterologous organisms, as well as partially or substantially purified 
DNA molecules in solution. In vivo and in vitro RNA transcripts of the DNA 
molecules of the present invention are also encompassed by "isolated" nucleotide 
5 sequences. Such isolated nucleotide sequences are useful in the manufacture of the 
encoded polypeptide, as probes for isolating homologous sequences (e.g., from other 
mammalian species), for gene mapping (e.g., by in situ hybridization with 
chromosomes), or for detecting expression of the gene in tissue (e.g., human tissue), 
such as by Northern blot analysis. 

10 The present invention also pertains to nucleic acid molecules which are not 

necessarily found in nature but which encode a protease polypeptide (e.g., a 
polypeptide having an amino acid sequence comprising an amino acid sequence 
selected from the group consisting of SEQ ID NOs:48-94), or another splicing 
variant of a protease polypeptide or polymorphic variant thereof. Thus, for example, 

15 DNA molecules which comprise a sequence that is different from the naturally- 
occurring nucleotide sequence but which, due to the degeneracy of the genetic code, 
encode a protease polypeptide of the present invention are also the subject of this 
invention. The invention also encompasses nucleotide sequences encoding portions 
(fragments), or encoding variant polypeptides such as analogues or derivatives of a 

20 protease polypeptide. Such variants can be naturally-occurring, such as in the case 
of allelic variation or single nucleotide polymorphisms, or non-naturally-occurring, 
such as those induced by various mutagens and mutagenic processes. Intended 
variations include, but are not limited to, addition, deletion and substitution of one or 
more nucleotides which can result in conservative or non-conservative amino acid 

25 changes, including additions and deletions. Preferably the nucleotide (and/or 

resultant amino acid) changes are silent or conserved; that is, they do not alter the 
characteristics or activity of a protease polypeptide. In one preferred embodiment, 
the nucleotide sequences are fragments that comprise one or more polymorphic 
microsatellite markers. In another preferred embodiment, the nucleotide sequences 

30 are fragments that comprise one or more single nucleotide polymorphisms in a 
protease gene. 
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Other alterations of the nucleic acid molecules of the invention can include, 
for example, labeling, methylation, internucleotide modifications such as uncharged 
linkages (e.g., methyl phosphonates, phosphotriesters, phosphoami dates, 
carbamates), charged linkages (eg., phosphorothioates, phosphorodithioates), 
5 pendent moieties (e.g., polypeptides), intercalators (e.g., acridine, psoralen), 
chelators, alkylators, and modified linkages (e.g., alpha anomeric nucleic acids). 
Also included are synthetic molecules that mimic nucleic acid molecules in the 
ability to bind to a designated sequences via hydrogen bonding and other chemical 
interactions. Such molecules include, for example, those in which peptide linkages 

1 0 substitute for phosphate linkages in the backbone of the molecule. 

The invention also pertains to nucleic acid molecules which hybridize under 
high stringency hybridization conditions, such as for selective hybridization, to a 
nucleotide sequence described herein (e.g., nucleic acid molecules which specifically 
hybridize to a nucleotide sequence encoding polypeptides described herein, and, 

1 5 optionally, have an activity of the polypeptide). In one embodiment, the invention 
includes variants described herein which hybridize under high stringency 
hybridization conditions (e.g., for selective hybridization) to a nucleotide sequence 
comprising a nucleotide sequence selected from the group consisting of SEQ ID 
NOs:l-47. In another embodiment, the invention includes variants described herein 

20 which hybridize under high stringency hybridization conditions (e.g. , for selective 
hybridization) to a nucleotide sequence encoding an amino acid sequence selected 
from the group consisting of SEQ ID NOs:48-94 or a polymorphic variant thereof. 
In a preferred embodiment, the variant which hybridizes under high stringency 
hybridizations has an activity of a protease. 

25 Such nucleic acid molecules can be detected and/or isolated by specific 

hybridization (e.g., trader high stringency conditions). "Specific hybridization," as 
used herein, refers to the ability of a first nucleic acid to hybridize to a second 
nucleic acid in a manner such that the first nucleic acid does not hybridize to any 
nucleic acid other than to the second nucleic acid (e.g., when the first nucleic acid 

30 has a higher similarity to the second nucleic acid than to any other nucleic acid in a 
sample wherein the hybridization is to be performed). "Stringency conditions" for 
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hybridization is a term of art which refers to the incubation and wash conditions, 
e.g., conditions of temperature and buffer concentration, which permit hybridization 
of a particular nucleic acid to a second nucleic acid; the first nucleic acid may be 
perfectly (i.e., 100%) complementary to the second, or the first and second may 
5 share some degree of complementarity which is less than perfect {e.g. , 70%, 75%, 
85%, 95%). For example, certain high stringency conditions can be used which 
distinguish perfectly complementary nucleic acids from those of less 
complementarity. "High stringency conditions", "moderate stringency conditions" 
and "low stringency conditions" for nucleic acid hybridizations are explained on 

10 pages 2.10.1-2.10.16 and pages 6.3.1-6.3.6 in Current Protocols in Molecular, 
Biology (Ausubel, F.M. et al, "Current Protocols in Molecular Biology", John 
Wiley & Sons, (1998), the entire teachings of which are incorporated by reference 
herein). The exact conditions which determine the stringency of hybridization 
depend not only on ionic strength {e.g., 0.2X SSC, 0.1X SSC), temperature {e.g., 

15 room temperature, 42°C, 68°C) and the concentration of destabilizing agents such as 
formamide or denaturing agents such as SDS, but also on factors such as the length 
of the nucleic acid sequence, base composition, percent mismatch between 
hybridizing sequences and the frequency of occurrence of subsets of that sequence 
within other non-identical sequences. Thus, equivalent conditions can be 

20 determined by varying one or more of these parameters while maintaining a similar 
degree of identity or similarity between the two nucleic acid molecules. Typically, 
conditions are used such that sequences at least about 60%, at least about 70%, at 
least about 80%, at least about 90% or at least about 95% or more identical to each 
other remain hybridized to one another. By varying hybridization conditions from a 

25 level of stringency at which no hybridization occurs to a level at which hybridization 
is first observed, conditions which will allow a given sequence to hybridize {e.g., 
selectively) with the most similar sequences in the sample can be determined. 

Exemplary conditions are described in Krause, M.H. and S.A. Aaronson, 
Methods in Enzymology 200:546-556 (1991), and in, Ausubel, et al, "Current 

30 Protocols in Molecular Biology 77 , John Wiley & Sons, (1998), which describes the 
determination of washing conditions for moderate or low stringency conditions. 
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Washing is the step in which conditions are usually set so as to determine a 
minimum level of complementarity of the hybrids. Generally, starting from the 
lowest temperature at which only homologous hybridization occurs, each °C by 
which the final wash temperature is reduced (holding SSC concentration constant) 
5 allows an increase by 1% in the maximum extent of mismatching among the 

sequences that hybridize. Generally, doubling the concentration of SSC results in an 
increase in T m of ~ 17°C. Using these guidelines, the washing temperature can be 
determined empirically for high, moderate or low stringency, depending on the level 
of mismatch sought. 

1 0 For example, a low stringency wash can comprise washing in a solution 

containing 0.2X SSC/0,1% SDS for 10 minutes at room temperature; a moderate 
stringency wash can comprise washing in aprewarmed solution (42°C) solution 
containing 0.2X SSC/0.1% SDS for 15 minutes at 42°C; and a high stringency wash 
can comprise washing in prewarmed (68°C) solution containing 0.1X SSC/0.1%SDS 

15 for 15 minutes at 68°C. Furthermore, washes can be performed repeatedly or 

sequentially to obtain a desired result as known in the art. Equivalent conditions can 
be determined by varying one or more of the parameters given as an example, as 
known in the art, while maintaining a similar degree of identity or similarity between 
the target nucleic acid molecule and the primer or probe used. 

20 The percent homology or identity of two nucleotide or amino acid sequences 

can be determined by aligning the sequences for optimal comparison purposes (e.g., 
gaps can be introduced in the sequence of a first sequence for optimal alignment). 
The nucleotides or amino acids at corresponding positions are then compared, and 
the percent identity between the two sequences is a function of the number of 

25 /identical positions shared by the sequences {i.e., % identity = # of identical 

positions/total # of positions x 100). When a position in one sequence is occupied by 
the same nucleotide or amino acid residue as the corresponding position in the other 
sequence, then the molecules are homologous at that position. As used herein, 
nucleic acid or amino acid "homology" is equivalent to nucleic acid or amino acid 

30 "identity". In certain embodiments, the length of a sequence aligned for comparison 
purposes is at least 30%, for example, at least 40%, in certain embodiments at least 
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60%, and in other embodiments at least 70%, 80%, 90% or 95% of the length of the 
reference sequence. The actual comparison of the two sequences can be 
accomplished by well-known methods, for example, using a mathematical 
algorithm. A preferred, non-limiting example of such a mathematical algorithm is 
5 described in Karlin et aL, Proc. Natl. Acad. Sci. USA 90:5873-5877 (1993). Such an 
algorithm is incorporated into the NBLAST and XBLAST programs (version 2.0) as 
described in Altschul et aL 9 Nucleic Acids Res. 25:389-3402 (1997). When utilizing 
BLAST and Gapped BLAST programs, the default parameters of the respective 
programs (e.g., NBLAST) can be used. In one embodiment, parameters for 
10 sequence comparison can be set at score=100, wordlength=12, or can be varied (e.g., 
W=5 or W=20). 

Another preferred, non-limiting example of a mathematical algorithm 
utilized for the comparison of sequences is the algorithm of Myers and Miller, 
CABIOS 4(1):1 1-17 (1988). Such an algorithm is incorporated into the ALIGN 

1 5 program (version 2.0) which is part of the GCG sequence alignment software 

package. When utilizing the ALIGN program for comparing amino acid sequences, 
a P AMI 20 weight residue table, a gap length penalty of 12 , and a gap penalty of 4 
can be used. Additional algorithms for sequence analysis are known in the art and 
include ADVANCE and ADAM as described in Torellis and Robotti, Comput. Appl 

20 Biosci. 10:3-5 (1994); and FASTA described in Pearson and Lipman, Proc. Natl 
Acad. Sci. USA 85:2444-8 (1988). 

In another embodiment, the percent identity between two amino acid 
sequences can be accomplished using the GAP program in the GCG software 
package (Accelrys, Cambridge, UK) using either a BLOSUM63 matrix or a 

25 PAM250 matrix, and a gap weight of 12, 10, 8, 6, or 4 and a length weight of 2, 3, or 
4. In yet another embodiment, the percent identity between two nucleic acid 
sequences can be accomplished using the GAP program in the GCG software 
package, using a gap weight of 50 and a length weigjit of 3. 

The present invention also provides isolated nucleic acid molecules that 

30 contain a fragment or portion that hybridizes under highly stringent conditions to a 
nucleotide sequence comprising a nucleotide sequence selected from the group 
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consisting of SEQ ID NOs: 1-47, or the complement of such a sequence, and also 
provides isolated nucleic acid molecules that contain a fragment or portion that 
hybridizes under highly stringent conditions to a nucleotide sequence encoding an 
amino acid sequence selected SEQ ID NOs:48-94, or polymorphic variant thereof. 
5 The nucleic acid fragments of the invention are at least about 15, preferably at least 
about 18, 20, 23 or 25 nucleotides, and can be 30, 40, 50, 100, 200 or more 
nucleotides in length. Longer fragments, for example, 30 or more nucleotides in 
length, which encode antigenic polypeptides described herein are particularly useful, 
such as for the generation of antibodies as described below. 

1 0 Probes and Primes 

In a related aspect, the nucleic acid fragments of the invention are used as 
probes or primers in assays such as those described herein. "Probes" or "primers" 
are oligonucleotides that hybridize in a base-specific manner to a complementary 
strand of nucleic acid molecules. Such probes and primers include polypeptide 

15 nucleic acids, as described in Nielsen et al, Science 254:1497-1500 (1991). 

A probe or primer comprises a region of nucleotide sequence that hybridizes 
to at least about 15, typically about 20-25, and more typically about 40, 50 or 75, 
consecutive nucleotides of a nucleic acid molecule comprising a contiguous 
nucleotide sequence selected from the group consisting of SEQ ID NOs: 1-47, or the 

20 complement of such a sequence, or a sequence encoding an amino acid sequence 
selected from SEQ ID NOs:48-94, or polymorphic variant thereof. In preferred 
embodiments, a probe or primer comprises 100 or fewer nucleotides, preferably 
from 6 to 50 nucleotides, preferably from 12 to 30 nucleotides. In other 
embodiments, the probe or primer is at least 70% identical to the contiguous 

25 nucleotide sequence or to the complement of the contiguous nucleotide sequence, 
preferably at least 80% identical, more preferably at least 90% identical, even more 
preferably at least 95% identical, or even capable of selectively hybridizing to the 
contiguous nucleotide sequence or to the complement of the contiguous nucleotide 
sequence. Often, the probe or primer further comprises a label, e.g., radioisotope, 

30 fluorescent compound, enzyme, or enzyme co-factor. 
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The nucleic acid molecules of the invention such as those described above 
can be identified and isolated using standard molecular biology techniques and the 
sequence information provided herein. For example, nucleic acid molecules can be 
amplified and isolated by the polymerase chain reaction using synthetic 

5 oligonucleotide primers designed based on one or more of the sequences selected 
. from the group consisting of SEQ ID NOs:l-47, or the complement of such a 
sequence, or designed based on nucleotides based on sequences encoding one or 
more of the amino acid sequences provided herein. See generally PCR Technology: 
Principles and Applications forDNA Amplification (ed. H.A. Erlich, Freeman Press, 

1 0 NY, NY, 1 992); PCR Protocols: A Guide to Methods and Applications (Eds. Innis et 
al, Academic Press, San Diego, CA, 1990); Mattila et al. 9 Nucl Acids Res. 19:4967 
(1991); Eckert et al, PCR Methods and Applications 1:17 (1991); PCR (eds. 
McPherson et al, IRL Press, Oxford); and U.S. Patent 4,683,202. The nucleic acid 
molecules can be amplified using cDNA, mRNA or genomic DNA as a template, 

15 cloned into an appropriate vector and characterized by DNA sequence analysis. 

Other suitable amplification methods include the ligase chain reaction (LCR) 
(see Wu and Wallace, Genomics 4:560 (1989), Landegren et al, Science 241 : 1077 

(1988) , transcription amplification (Kwoh et al, Proc. Natl Acad. Set USA 86:1173 

i 

(1989) ), and self-sustained sequence replication (Guatelli et al, Proc. Nat. Acad. 
20 Set USA 87:1874 (1990)) and nucleic acid based sequence amplification (NASBA). 

The latter two amplification methods involve isothermal reactions based on 
isothermal transcription, which produce both single stranded RNA (ssRNA) and 
double stranded DNA (dsDNA) as the amplification products in a ratio of about 30 
or 100 to 1, respectively. 

25 The amplified DNA can be labelled, for example radiolabelled, and used as a 

probe for screening a cDNA library derived from human cells, mRNA in zap 
express, Z3PLOX or other suitable vector. Corresponding clones can be isolated, 
DNA can obtained following in vivo excision, and the cloned insert can be 
sequenced in either or both orientations by art recognized methods to identify the 

30 correct reading frame encoding a polypeptide of the appropriate molecular weight. 
For example, the direct analysis of the nucleotide sequence of nucleic acid molecules 
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of the present invention can be accomplished using well-known methods that are 
commercially available. See, for example, Sambrook et ah, Molecular Cloning, A 
Laboratory Manual (2nd Ed., CSHP, New York 1989); Zyskind et aL, Recombinant 
DNA Laboratory Manual, (Acad. Press, 1988)). Additionally, fluorescence methods 
5 are also available for analyzing nucleic acids (Chen et aL, Genome Res. 9, 492 
(1999)) and polypeptides. Using these or similar methods, the polypeptide and the 
DNA encoding the polypeptide can be isolated, sequenced and further characterized. 

Antisense nucleic acid molecules of the invention can be designed using the 
nucleotide sequences of one or more of SEQ ID NOs:l-47 and/or the complement of 

10 one or more of SEQ ID NOs:l-47, and/or a portion of one or more of SEQ ID 
NOs:l-47, or the complement of one or more of SEQ ID NOs:l-47 and/or a 
sequence encoding the amino acid sequences of one or more of SEQ ID NOs:48-94, 
or encoding a portion of one or more of SEQ ID NOs:48-94, and constructed using 
chemical synthesis and enzymatic ligation reactions using procedures known in the 

15 art. For example, an antisense nucleic acid molecule {e.g., an antisense 

oligonucleotide) can be chemically synthesized using naturally occurring nucleotides 
or variously modified nucleotides designed to increase the biological stability of the 
molecules or to increase the physical stability of the duplex formed between the 
antisense and sense nucleic acids, e.g., phosphorothioate derivatives and acridine 

20 substituted nucleotides can be used. Alternatively, the antisense nucleic acid 
molecule can be produced biologically using an expression vector into which a 
nucleic acid molecule has been subcloned in an antisense orientation {i.e., RNA 
transcribed from the inserted nucleic acid molecule will be of an antisense 
orientation to a target nucleic acid of interest). 

25 In general, the isolated nucleic acid sequences of the invention can be used as 

molecular weight markers on Southern gels, and as chromosome markers which are 
labeled to map related gene positions. The nucleic acid sequences can also be used 
to compare with endogenous DNA sequences in patients to identify one or more of 
the disorders described above, and as probes, such as to hybridize and discover 

30 related DNA sequences or to subtract out known sequences from a sample. The 
nucleic acid sequences can further be used to derive primers for genetic 
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fingerprinting, to raise anti-polypeptide antibodies using DNA immunization 
techniques, and as an antigen to raise anti-DNA antibodies or elicit immune 
responses. Portions or fragments of the nucleotide sequences identified herein (and 
the corresponding complete gene sequences) can be used in numerous ways as 
5 polynucleotide reagents. For example, these sequences can be used to: (i) map their 
respective genes on a chromosome; and, thus, locate gene regions associated with 
genetic disease; (ii) identify an individual from a minute biological sample (tissue 
typing); and (iii) aid in forensic identification of a biological sample. Additionally, 
the nucleotide sequences of the invention can be used to identify and express 

10 recombinant polypeptides for analysis, characterization or therapeutic use, or as 
markers for tissues in which the corresponding polypeptide is expressed, either 
constitutively, during tissue differentiation, or in diseased states. The nucleic acid 
sequences can additionally be used as reagents in the screening and/or diagnostic 
assays described herein, and can also be included as components of kits (e.g., 

15 reagent kits) for use in the screening and/or diagnostic assays described herein. 

Vectors 

Another aspect of the invention pertains to nucleic acid constructs containing 
a nucleic acid molecule selected from the group consisting of SEQ ID NOs:l-47 and 
the complements -thereof (or a portion thereof). Yet another aspect of the invention 

20 pertains to nucleic acid constructs containing a nucleic acid molecule encoding an 
amino acid sequence of SEQ ID NOs:48-94 or polymorphic variant thereof. The 
constructs comprise a vector {e.g., an expression vector) into which a sequence of 
the invention has been inserted in a sense or antisense orientation. As used herein, 
the term 'Vector" refers to a nucleic acid molecule capable of transporting another 

25 nucleic acid to which it has been linked. One type of vector is a "plasmid", which 
refers to a circular double stranded DNA loop into which additional DNA segments 
can be ligated. Another type of vector is a viral vector, wherein additional DNA 
segments can be ligated into the viral genome. Certain vectors are capable of 
autonomous replication in a host cell into which they are introduced (e.g., bacterial 

30 vectors having a bacterial origin of replication and episomal mammalian vectors). 
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Other vectors (e.g., non-episomal mammalian vectors) are integrated into the 
genome of a host cell upon introduction into the host cell, and thereby are replicated 
along with the host genome. Moreover, certain vectors, expression vectors, are 
capable of directing the expression of genes to which they are operably linked. In 
5 general, expression vectors of utility in recombinant DNA techniques are often in the 
form of plasmids. However, the invention is intended to include such other forms of 
expression vectors, such as viral vectors (e.g., "replication defective retroviruses, 
adenoviruses and adeno-associated viruses) that serve equivalent functions. 

Preferred recombinant expression vectors of the invention comprise a nucleic 
10 acid molecule of the invention in a form suitable for expression of the nucleic acid 
molecule in a host cell. This means that the recombinant expression vectors include 
- one or more regulatory sequences, selected on the basis of the host cells to be used 
for expression, which is operably linked to the nucleic acid sequence to be 
expressed. Within a recombinant expression vector, "operably linked" or 
15 "operatively linked" is intended to mean that the nucleotide sequence of interest is 
linked to the regulatory sequence(s) in a manner which allows for expression of the 
nucleotide sequence (e.g., in an in vitro transcription/translation system or in a host 
cell when the vector is introduced into the host cell). The term "regulatory 
sequence" is intended to include promoters, enhancers and other expression control 
20 elements (e.g., polyadenylation signals). Such regulatory sequences are described, 
for example, in Goeddel, "Gene Expression Technology", Methods in Enzymology 
1 85, Academic Press, San Diego, CA (1990). Regulatory sequences include those 
which direct constitutive expression of a nucleotide sequence in many types of host 
cell and those which direct expression of the nucleotide sequence only in certain host 
25 cells (e.g. , tissue-specific regulatory sequences). It will be appreciated by those 

skilled in the art that the design of the expression vector can depend on such factors 
as the choice of the host cell to be transformed and the level of expression of 
polypeptide desired. The expression vectors of the invention can be introduced into 
host cells to thereby produce polypeptides, including fusion polypeptides, encoded 
30 by nucleic acid molecules as described herein. 
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The recombinant expression vectors of the invention can be designed for 
expression of a polypeptide of the invention in prokaryotic or eukaryotic cells, e.g., 
bacterial cells such as E. coli, insect cells (using baculovirus expression vectors), 
yeast cells or mammalian cells. Suitable host cells are discussed further in Goeddel, 
5 supra. Alternatively, the recombinant expression vector can be transcribed and 
translated in vitro, for example using T7 promoter regulatory sequences and T7 
polymerase. 

Another aspect of the invention pertains to host cells into which a 
recombinant expression vector of the invention has been introduced. The terms 

1 0 "host cell" and "recombinant host cell" are used interchangeably herein. It is 

understood that such terms refer not only to the particular subject cell but also to the 
progeny or potential progeny of such a cell. Because certain modifications may 
occur in succeeding generations due to either mutation or environmental influences, 
such progeny may not, in fact, be identical to the parent cell, but are still included 

1 5 within the scope of the term as used herein. 

A host cell can be any prokaryotic or eukaryotic cell. For example, a nucleic 
acid molecule of the invention can be expressed in bacterial cells (e.g., E. coli), 
insect cells, yeast or mammalian cells (such as Chinese hamster ovary cells (CHO) 
or COS cells). Other suitable host cells are known to those skilled in the art. 

20 Vector DNA can be introduced into prokaryotic or eukaryotic cells via 

conventional transformation or transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer to a variety of 
art-recognized techniques for introducing a foreign nucleic acid molecule (e.g., 
DNA) into a host cell, including calcium phosphate or calcium chloride 

25 co-precipitation, DEAE-dextran-mediated transfection, lipofection, or 

electroporation. Suitable methods for transforming or transfecting host cells can be 
found in Sambrook, et al. (supra), and other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon 
the expression vector and transfection technique used, only a small fraction of cells 

30 may integrate the foreign DNA into their genome. In order to identify and select 
these integrants, a gene that encodes a selectable marker (e.g., for resistance to 
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antibiotics) is generally introduced into the host cells along with the gene of interest. 
Preferred selectable markers include those that confer resistance to drugs, such as 
G41 8, hygromycin and methotrexate. Nucleic acid molecules encoding a selectable 
marker can be introduced into a host cell on the same vector as the nucleic acid 
5 molecule of the invention or can be introduced on a separate vector. Cells stably 
transfected with the introduced nucleic acid molecule can be identified by drug 
selection (e.g., cells that have incorporated the selectable marker gene will survive, 
while the other cells die). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 

10 culture, can be used to produce (i.e., express) a polypeptide of the invention. 

Accordingly, the invention further provides methods for producing a polypeptide 
using the host cells of the invention. In one embodiment, the method comprises 
culturing the host cell of invention (into which a recombinant expression vector 
encoding a polypeptide of the invention has been introduced) in a suitable medium 

1 5 such that the polypeptide is produced. In another embodiment, the method further 
comprises isolating the polypeptide from the medium or the host cell. 

The host cells of the invention can also be used to produce nonhuman 
transgenic animals. For example, in one embodiment, a host cell of the invention is 
a fertilized oocyte or an embryonic stem cell into which a nucleic acid molecule of 

20 the invention has been introduced (e.g., an exogenous protease gene, or an 

exogenous nucleic acid encoding a protease polypeptide). Such host cells can then 
be used to create non-human transgenic animals in which exogenous nucleotide 
sequences have been introduced into the genome or homologous recombinant 
animals in which endogenous nucleotide sequences have been altered. Such animals 

25 are useful for studying the function and/or activity of the nucleotide sequence and 
polypeptide encoded by the sequence and for identifying and/or evaluating 
modulators of their activity. As used herein, a "transgenic animal" is a non-human 
animal, preferably a mammal, more preferably a rodent such as a rat or mouse, in 
which one or more of the cells of the animal includes a transgene. Other examples 

30 of transgenic animals include non-human primates, sheep, dogs, cows, goats, 

chickens and amphibians. A transgene is exogenous DNA which is integrated into 
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the genome of a cell from which a transgenic animal develops and which remains in 
the genome of the mature animal, thereby directing the expression of an encoded 
gene product in one or more cell types or tissues of the transgenic animal. As used 
herein, an "homologous recombinant animal" is a non-human animal, preferably a 
5 mammal, more preferably a mouse, in which an endogenous gene has been altered 
by homologous recombination between the endogenous gene and an exogenous. 
DNA molecule introduced into a cell of the animal, e.g., an embryonic cell of the 
animal, prior to development of the animal. 

Methods for generating transgenic animals via embryo manipulation and 

10 microinjection, particularly animals such as mice, have become conventional in the 
art and are described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009, 
U.S. Pat. No. 4,873,191 and in Hogan, Manipulating the Mouse Embryo (Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986). Methods for 
constructing homologous recombination vectors and homologous recombinant 

15 animals are described further in Bradley, Current Opinion in BioTechnology 

2:823-829 (1991) and in PCT Publication Nos. WO 90/11354, WO 91/01140, WO 
92/0968, and WO 93/04169. Clones of the non-human transgenic animals described 
herein can also be produced according to the methods described in Wilmut et al, 
Nature 385:810-813 (1997) and PCT Publication Nos. WO 97/07668 and WO 

20 97/07669. 

POLYPEPTIDES OF THE INVENTION 

The present invention also pertains to isolated polypeptides encoded by 
proteases ("protease polypeptides") and fragments and variants thereof, as well as 
polypeptides encoded by nucleotide sequences described herein {e.g., other splicing 

25 variants). The term "polypeptide" refers to a polymer of amino acids, and not to a 
specific length; thus, peptides, oligopeptides and proteins are included within the 
definition of a polypeptide. As used herein, a polypeptide is said to be "isolated" or 
"purified" when it is substantially free of cellular material when it is isolated from 
recombinant and non-recombinant cells, or free of chemical precursors or other 

30 chemicals when it is chemically synthesized. A polypeptide, however, can be joined 
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to another polypeptide with which it is not normally associated in a cell (e.g., in a 
"fusion protein") and still be "isolated" or "purified." 

The polypeptides of the invention can be purified to homogeneity. It is 
understood, however, that preparations in which the polypeptide is not purified to 
5 homogeneity are useful. The critical feature is that the preparation allows for the 
desired function of the polypeptide, even in the presence of considerable amounts of 
other components. Thus, the invention encompasses various degrees of purity. In 
one embodiment, the language "substantially free of cellular material" includes 
preparations of the polypeptide having less than about 30% (by dry weight) other 

10 proteins (i.e. , contaminating protein), less than about 20% other proteins, less than 
about 10% other proteins, or less than about 5% other proteins. 

When a polypeptide is recombinantly produced, it can also be substantially 
free of culture medium, i.e., culture medium represents less than about 20%, less 
than about 10%, or less than about 5% of the volume of the polypeptide preparation. 

15 The language "substantially free of chemical precursors or other chemicals" includes 
preparations of the polypeptide in which it is separated from chemical precursors or 
other chemicals that are involved in its synthesis. In one embodiment, the language 
"substantially free of chemical precursors or other chemicals" includes preparations 
of the polypeptide having less than about 30% (by dry weight) chemical precursors 

20 or other chemicals, less than about 20% chemical precursors or other chemicals, less 
than about 10% chemical precursors or other chemicals, or less than about 5% 
chemical precursors or other chemicals. 

In one embodiment, a polypeptide of the invention comprises an amino acid 
sequence encoded by a nucleic acid molecule comprising a nucleotide sequence 

25 selected from the group consisting of SEQ ID NOs:l-47, or the complement of such 
a nucleic acid, or portions thereof, e.g., SEQ ID NO:48-94, or a portion or 
polymorphic variant thereof. However, the polypeptides of the invention also 
encompass fragment and sequence variants. Variants include a substantially 
homologous polypeptide encoded by the same genetic locus in an organism, z.e., an 

30 allelic variant, as well as other splicing variants. Variants also encompass 

polypeptides derived from other genetic loci in an organism, but having substantial 
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homology to a polypeptide encoded by a nucleic acid molecule comprising a 
nucleotide sequence selected from the group consisting of SEQ ID NOs:l-47, or a 
complement of such a sequence, or portions thereof, or having substantial homology 
to a polypeptide encoded by a nucleic acid molecule comprising a nucleotide 
5 sequence selected from the group consisting of nucleotide sequences encoding SEQ 
ID NOs:48-94, or polymorphic variants thereof. Variants also include polypeptides 
substantially homologous or identical to these polypeptides but derived from another 
organism, i.e., an ortholog. Variants also include polypeptides that are substantially 
homologous or identical to these polypeptides that are produced by chemical 
1 0 synthesis. Variants also include polypeptides that are substantially homologous or 
identical to these polypeptides that are produced by recombinant methods. 

As used herein, two polypeptides (or a region of the polypeptides) are 
substantially homologous or identical when the amino acid sequences are at least 
about 45-55%, in certain embodiments at least about 70-75%, in other embodiments 
15 at least about 80-85%, and in other embodiments greater than about 90% or more 
homologous or identical. A substantially homologous amino acid sequence, 
according to the present invention, will be encoded by a nucleic acid molecule 
hybridizing to one or more of SEQ ID NOs: 1 -47, or portion thereof, under stringent 
conditions as more particularly described above, or will be encoded by a nucleic acid 
20 molecule hybridizing to a nucleic acid sequence encoding one of SEQ ID NOs:48- 
94, a portion thereof or polymorphic variant thereof, under stringent conditions as 
more particularly described thereof. 

The invention also encompasses polypeptides having a lower degree of 
identity but having sufficient similarity so as to perform one or more of the same 
25 functions performed by a polypeptide encoded by a nucleic acid molecule of the 
invention. Similarity is determined by conserved amino acid substitution. Such 
substitutions are those that substitute a given amino acid in a polypeptide by another 
amino acid of like characteristics. Conservative substitutions are likely to be 
phenotypically silent Typically seen as conservative substitutions are the 
30 replacements, one for another, among the aliphatic amino acids Ala, Val, Leu and 

He; interchange of the hydroxyl residues Ser and Thr, exchange of the acidic residues 
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Asp and Glu, substitution between the amide residues Asn and Gin, exchange of the 
basic residues Lys and Arg and replacements among the aromatic residues Phe and 
Tyr. Guidance concerning which amino acid changes are likely to be phenotypically 
silent are found in Bowie et al 9 Science 247:1306-1310 (1990). 
5 A variant polypeptide can differ in amino acid sequence by one or more 

substitutions, deletions, insertions, inversions, fusions, and truncations or a 
combination of any of these. Further, variant polypeptides can be fully functional or 
can lack function in one or more activities. Fully functional variants typically 
contain only conservative variation or variation in non-critical residues or in 

10 non-critical regions. Functional variants can also contain substitution of similar - 
amino acids that result in no change or an insignificant change in function. 
Alternatively, such substitutions may positively or negatively affect function to some 
degree. Non- functional variants typically contain one or more non-conservative 
amino acid substitutions, deletions, insertions, inversions, or truncation or a 

15 substitution, insertion, inversion, or deletion in a critical residue or critical region. 

Amino acids that are essential for function can be identified by methods 
known in the art, such as site-directed mutagenesis or alanine-scanning mutagenesis 
(Cunningham et al 9 Science 244:1081-1085 (1989)). The latter procedure 
introduces single alanine mutations at every residue in the molecule. The resulting 

20 mutant molecules are then tested for biological activity in vifro, or in vitro 

proliferative activity. Sites that are critical for polypeptide activity can also be 
determined by structural analysis such as crystallization, nuclear magnetic resonance 
or photoaffinity labeling (Smith et al y J. Mol Biol 224:899-904 (1992); de Vos et 
al, Science 255:306-312 (1992)). 

25 The invention also includes polypeptide fragments of the polypeptides of the 

invention. Fragments can be derived from a polypeptide encoded by a nucleic acid 
molecule comprising one of SEQ ID NOs:l-47, or a complement of such a nucleic 
acid (e.g., SEQ ID NOs:48-94, or other variants). However, the invention also 
encompasses fragments of the variants of the polypeptides described herein. As 

30 used herein, a fragment comprises at least 6 contiguous amino acids. Useful 

fragments include those that retain one or more of the biological activities of the 
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polypeptide as well as fragments that can be used as an immunogen to generate 
polypeptide-specific antibodies. 

Biologically active fragments (peptides which are, for example, 6, 9, 12, 15, 
16, 20, 30, 35, 36, 37, 38, 39, 40, 50, 100 or more amino acids in length) can 
5 comprise a domain, segment, or motif that has been identified by analysis of the 
polypeptide sequence using well-known methods, e.g., signal peptides, extracellular 
domains, one or more transmembrane segments or loops, ligand binding regions, 
zinc finger domains, DNA binding domains, acylation sites, glycosylation sites, or 
phosphorylation sites. 

10 Fragments can be discrete (not fused to other amino acids or polypeptides) or 

can be within a larger polypeptide. Further, several fragments can be comprised 
within a single larger polypeptide. In one embodiment a fragment designed for 
expression in a host can have heterologous pre- and pro-polypeptide regions fused to 
the amino terminus of the polypeptide fragment and an additional region fused to the 

1 5 carboxyl terminus of the fragment. 

The invention thus provides chimeric or fusion polypeptides. These 
comprise a polypeptide of the invention operatively linked to a heterologous protein 
or polypeptide having an amino acid sequence not substantially homologous to the 
polypeptide. "Operatively linked" indicates that the polypeptide and the 

20 heterologous protein are fused in-frame. The heterologous protein can be fused to 
the N-terminus or C-terminus of the polypeptide. In one embodiment the fusion 
polypeptide does not affect function of the polypeptide per se. For example, the 
fusion polypeptide can be a GST-fusion polypeptide in which the polypeptide 
sequences are fused to the C-terminus of the GST sequences. Other types of fusion 

25 polypeptides include, but are not limited to, enzymatic fusion polypeptides, for 
example P-galactosidase fusions, yeast two-hybrid GAL fusions, poly-His fusions 
and Ig fusions. Such fusion polypeptides, particularly poly-His fusions, can 
facilitate the purification of recombinant polypeptide. In certain host cells (e.g., 
mammalian host cells), expression and/or secretion of a polypeptide can be 

30 increased by using a heterologous signal sequence. Therefore, in another 
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embodiment, the fusion polypeptide contains a heterologous signal sequence at its 
N- terminus. 

EP A 0 464 533 discloses fusion proteins comprising various portions of 
immunoglobulin constant regions. The Fc is useful in therapy and diagnosis and 
5 thus results, for example, in improved pharmacokinetic properties (EP A 0 232 262). 
In drug discovery, for example, human protems have been fused with Fc portions for 
the purpose of high-throughput screening assays to identify antagonists. Bennett et 
al y Journal of Molecular Recognition, 5:52-58 (1995) and Johanson et al> The 
Journal of Biological Chemistry, 270,16:9459-9471 (1995). Thus, this invention 

10 also encompasses soluble fusion polypeptides containing a polypeptide of the 
invention and various portions of the constant regions of heavy or light chains of 
immunoglobulins of various subclass (IgG, IgM, IgA, IgE). 

A chimeric or fusion polypeptide can be produced by standard recombinant 
DNA techniques. For example, DNA fragments coding for the different polypeptide 

15 sequences are ligated together in-frame in accordance with conventional techniques. 
In another embodiment, the fusion gene can be synthesized by conventional 
techniques including automated DNA synthesizers. Alternatively, PCR amplification 
of nucleic acid fragments can be carried out using anchor primers which give rise to 
complementary overhangs between two consecutive nucleic acid fragments which 

20 can subsequently be annealed and re-amplified to generate a chimeric nucleic acid 
sequence (see Ausubel et al, Current Protocols in Molecular Biology, 1992). 
Moreover, many expression vectors are commercially available that already encode a 
fusion moiety {e.g., a GST protein). A nucleic acid molecule encoding a polypeptide 
of the invention can be cloned into such an expression vector such that the fusion 

25 moiety is linked in- frame to the polypeptide. 

The isolated polypeptide can be purified from cells that naturally express it, 
purified from cells that have been altered to express it (recombinant), or synthesized 
using known protein synthesis methods. In one embodiment, the polypeptide is 
produced by recombinant DNA techniques. For example, a nucleic acid molecule 

30 encoding the polypeptide is cloned into an expression vector, the expression vector 
introduced into a host cell and the polypeptide expressed in the host cell. The 
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polypeptide can then be isolated from the cells by an appropriate purification scheme 
using standard protein purification techniques. 

In general, polypeptides of the present invention can be used as a molecular 
weight marker on SDS-PAGE gels or on molecular sieve gel filtration columns 
5 using art-recognized methods. The polypeptides of the present invention can be 
used to raise antibodies or to elicit an immune response. The polypeptides can also 
be used as a reagent, e.g. , a labeled reagent, in assays to quantitatively determine 
levels of the polypeptide or a molecule to which it binds (e.g., a ligand) in biological 
fluids. The polypeptides can also be used as markers for cells or tissues in which the 
10 corresponding polypeptide is preferentially expressed, either constitutively, during 
tissue differentiation, or in a diseased state. The polypeptides can be used to isolate 
a corresponding binding agent, e.g., ligand, such as, for example, in an interaction 
trap assay, and to screen for peptide or small molecule antagonists or agonists of the 
binding interaction. 

1 5 ANTIBODIES OF THE INVENTION 

Polyclonal and/or monoclonal antibodies that specifically bind one form of 
the gene product but not to the other form of the gene product are also provided. 
Antibodies are also provided that bind a portion of either the variant or the reference 
gene product that contains the polymorphic site or sites. The invention provides 

20 antibodies to the polypeptides and polypeptide fragments of the invention, e.g., 

having an amino acid sequence of one of SEQ ID NOs:48-94 or a portion thereof, or 
having an amino acid sequence encoded by a nucleic acid molecule comprising all or 
a portion of SEQ ID NOs: 1-47, or a complement or another variant or portion . 
thereof. The term "antibody as used herein refers to immunoglobulin molecules 

25 and immunologically active portions of immunoglobulin molecules, i.e., molecules 
that contain an antigen binding site that specifically binds an antigen. A molecule 
that specifically binds to a polypeptide of the invention is a molecule that binds to 
that polypeptide or a fragment thereof, but does not substantially bind other 
molecules in a sample, e.g., 2l biological sample, which naturally contains the 

30 polypeptide. Examples of immunologically active portions of immunoglobulin 
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molecules include F(ab) and F(ab') 2 fragments which can be generated by treating 
the antibody with an enzyme such as pepsin. The invention provides polyclonal and 
monoclonal antibodies that bind to a polypeptide of the invention. The term 
"monoclonal antibody" or "monoclonal antibody composition", as used herein, 
5 refers to a population of antibody molecules that contain only one species of an 
antigen binding site capable of immunoreacting with a particular epitope of a 
polypeptide of the invention. A monoclonal antibody composition thus typically 
displays a single binding affinity for a particular polypeptide of the invention with 
which it immunoreacts. 
10 Polyclonal antibodies can be prepared as described above by immunizing a 

suitable subject with a desired immunogen, e.g., polypeptide of the invention or 
fragment thereof. The antibody titer in the immunized subject can be monitored 
over time by standard techniques, such as with an enzyme linked immunosorbent 
assay (ELISA) using immobilized polypeptide. If desired, the antibody molecules 
1 5 directed against the polypeptide can be isolated from the mammal (e.g., from the 
blood) and further purified by well-known techniques, such as protein A 
chromatography to obtain the IgG fraction. At an appropriate time after 
immunization, e.g., when the antibody titers are highest, antibody-producing cells 
can be obtained from the subject and used to prepare monoclonal antibodies by 
20 standard techniques, such as the hybridoma technique originally described by Kohler 
and Milstein, Nature 256:495-497 (1975), the human B cell hybridoma technique 
(Kozbor et al, Immunol Today 4:72 (1983)), the EBV-hybridoma technique (Cole 
et al. , Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, 1985, Inc., pp. 
77-96) or trioma techniques. The technology for producing hybridomas is well 
25 known (see generally Current Protocols in Immunology (1994) Coligan et al (eds.) 
John Wiley & Sons, Inc., New York, NY). Briefly, an immortal cell line (typically a 
myeloma) is fused to lymphocytes (typically splenocytes) from a mammal 
immunized with an immunogen as described above, and the culture supematants of 
the resulting hybridoma cells are screened to identify a hybridoma producing a 
30 monoclonal antibody that binds a polypeptide of the invention. 
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Any of the many well known protocols used for fusing lymphocytes and 
immortalized cell lines can be applied for the purpose of generating a monoclonal 
antibody to a polypeptide of the invention (see, e.g., Current Protocols in 
Immunology, supra; Galfre et al, Nature 266:55052 (1977); R.H. Kenneth, in 
5 Monoclonal Antibodies: A New Dimension In Biological Analyses, Plenum 
Publishing Corp., New York, New York (1980); and Lerner, Yale J. Biol Med. 
54:387-402 (1981). Moreover, the ordinarily skilled worker will appreciate that 
there are many variations of such methods that also would be useful. 

Alternative to preparing monoclonal antibody-secreting hybridomas, a 

10 monoclonal antibody to a polypeptide of the invention can be identified and isolated 
by screening a recombinant combinatorial immunoglobulin library {e.g., an antibody 
phage display library) with the polypeptide to thereby isolate immunoglobulin 
library members that bind the polypeptide. Kits for generating and screening phage 
display libraries are commercially available {e.g., the Pharmacia Recombinant Phage 

1 5 Antibody System, Catalog No. 27-9400-0 1 ; and the Stratagene SurJZAP™ Phage 
Display Kit, Catalog No. 240612). Additionally, examples of methods and reagents 
particularly amenable for use in generating and screening antibody display library 
can be found in, for example, U.S. Patent No. 5,223,409; PCT Publication No. WO 
92/18619; PCT Publication No. WO 91/17271; PCT Publication No. WO 92/20791; 

20 PCT Publication No. WO 92/15679; PCT Publication No. WO 93/01288; PCT 
Publication No. WO 92/01047; PCT Publication No. WO 92/09690; PCT 
Publication No. WO 90/02809; Fuchs et al, Bio/Technology 9:1370-1372 (1991); 
Hay et al, Hum. Antibod. Hybridomas 3:81-85 (1992); Huse et al, Science 
246:1275-1281 (1989); Griffiths et al, EMBO J. 12:725-734(1993). 

25 Additionally, recombinant antibodies, such as chimeric and humanized 

monoclonal antibodies, comprising both human and non-human portions, which can 
be made using standard recombinant DNA techniques, are within the scope of the 
invention. Such chimeric and humanized monoclonal antibodies can be produced by 
recombinant DNA techniques known in the art. 

30 In general, antibodies of the invention {e.g., a monoclonal antibody) can be 

used to isolate a polypeptide of the invention by standard techniques, such as affinity 
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chromatography or immunoprecipitation. A polypeptide-specific antibody can 
facilitate the purification of natural polypeptide from cells and of recombinantly 
produced polypeptide expressed in host cells, Moreover, an antibody specific for a 
polypeptide of the invention can be used to detect the polypeptide {e.g., in a cellular 
5 lysate, cell supernatant, or tissue sample) in order to evaluate the abundance and 
pattern of expression of the polypeptide. Antibodies can be used diagnostically to 
monitor protein levels in tissue as part of a clinical testing procedure, e.g., to, for 
example, determine the efficacy of a given treatment regimen. Detection can be 
facilitated by coupling the antibody to a detectable substance. Examples of 

10 detectable substances include various enzymes, prosthetic groups, fluorescent 
materials, luminescent materials, bioluminescent materials, and radioactive 
materials. Examples of suitable enzymes include horseradish peroxidase, alkaline 
phosphatase, P-galactosidase, or acetylcholinesterase; examples of suitable 
prosthetic group complexes include streptavidin/biotin and avidin/biotin; examples 

15 of suitable fluorescent materials include umbelliferone, fluorescein, fluorescein 
isothiocyanate, rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or 
phycoerythrin; an example of a luminescent material includes luminol; examples of 
bioluminescent materials include luciferase, luciferin, and aequoiin, and examples of 
suitable radioactive material include 125 I, 131 I, 35 S or 3 H. 

20 DIAGNOSTIC AND SCREENING ASSAYS OF THE INVENTION 

The present invention also pertains to a method of diagnosing or aiding in the 
diagnosis of a disease or condition associated with a protease gene or gene product 
in an individual. Diagnostic assays can be designed for assessing protease gene 
expression, or for assessing activity of protease polypeptides of the invention. In 

25 one embodiment, the assays are used in the context of a biological sample (e.g. , 
blood, serum, cells, tissue) to thereby determine whether an individual is afflicted 
with a disease or condition associated with a protease, or a defect in a protease. The 
invention also provides for prognostic (or predictive) assays for determining whether 
an individual is susceptible to a disease or condition associated with a protease. For 

30 example, mutations in the gene can be assayed in a biological sample. Such assays 
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can be used for prognostic or predictive purpose to thereby prophylactically treat an 
individual prior to the onset of symptoms associated with a susceptibility to a 
disease or condition associated with a protease. Another aspect of the invention 
pertains to assays for monitoring the influence of agents (e.g., drugs, compounds or 
5 other agents) on the gene expression or activity of polypeptides of the invention, as 
well as to assays for identifying agents which bind to a polypeptides. These and 
other assays and agents are described in further detail in the following sections. 

DIAGNOSTIC ASSAYS 

The nucleic acids, probes, primers, polypeptides and antibodies described 

10 herein can be used in methods of diagnosis of a susceptibility to a disease or 
condition associated with a protease, as well as in kits useful for diagnosis of a 
susceptibility to a disease or condition associated with a protease. 

In one embodiment of the invention, diagnosis of a susceptibility to a disease 
or condition associated with a protease is made by detecting a polymorphism in a 

1 5 protease as described herein. The polymorphism can be a mutation in a protease, 
such as the insertion or deletion of a single nucleotide, or of more than one 
nucleotide, resulting in a frame shift mutation; the change of at least one nucleotide, 
resulting in a change in the encoded amino acid; the change of at least one 
nucleotide, resulting in the generation of a premature stop codon; the deletion of 

20 several nucleotides, resulting in a deletion of one or more amino acids encoded by 
the nucleotides; the insertion of one or several nucleotides, such as by unequal 
recombination or gene conversion, resulting in an interruption of the coding 
sequence of the gene; duplication of all or a part of the gene; transposition of all or a 
part of the gene; or rearrangement of all or a part of the gene. More than one such 

25 mutation may be present in a single gene. Such sequence changes cause a mutation 
in the polypeptide encoded by a protease gene. For example, if the mutation is a 
frame shift mutation, the frame shift can result in a change in the encoded amino 
acids, and/or can result in the generation of a premature stop codon, causing 
generation of a truncated polypeptide. Alternatively, a polymorphism associated 

30 with a susceptibility to a disease or condition associated with a protease can be a 
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synonymous alteration in one or more nucleotides (z.e., an alteration that does not 
result in a change in the polypeptide encoded by a protease gene). Such a 
polymorphism may alter splicing sites, affect the stability or transport of mRNA, or 
otherwise affect the transcription or translation of the gene. A protease gene that has 
5 any of the alterations described above is referred to herein as a "altered gene." 

In a first method of diagnosing a disease or a susceptibility to a disease or 
condition associated with a protease, hybridization methods, such as Southern 
analysis, Northern analysis, or in situ hybridizations, can be used (see Current 
Protocols in Molecular Biology, Ausubel, F. et al, eds., John Wiley & Sons, 

10 including all supplements through 2001). For example, a biological sample from a 
test subject (a "test sample") of genomic DNA, RNA, or cDNA, is obtained from an 
individual suspected of having, being susceptible to or predisposed for, or carrying a 
defect for, a susceptibility to a disease or condition associated with a protease (the 
"test individual"). The individual can be an adult, child, or fetus. The test sample 

15 can be from any source which contains genomic DNA, such as a blood sample, 

sample of amniotic fluid, sample of cerebrospinal fluid, or tissue sample from skin, 
muscle, buccal or conjunctival mucosa, placenta, gastrointestinal tract or other 
organs. A test sample of DNA from fetal cells or tissue can be obtained by 
appropriate methods, such as by amniocentesis or chorionic villus sampling. The 

20 DNA, RNA, or cDNA sample is then examined to determine whether a 

polymorphism in a protease is present, and/or to determine which splicing variant(s) 
encoded by the protease is present. The presence of the polymorphism or splicing 
variant(s) can be indicated by hybridization of the gene in the genomic DNA, RNA, 
or cDNA to a nucleic acid probe. A "nucleic acid probe", as used herein, can be a 

25 DNA probe or an RNA probe; the nucleic acid probe can contain at least one 

polymorphism in a protease or contains a nucleic acid encoding a particular splicing 
variant of a protease. The probe can be any of the nucleic acid molecules described 
above (e.g., the gene, a fragment, a vector comprising the gene, a probe or primer, 
etc.). 

30 To diagnose a disease or a susceptibility to a disease or condition associated 

with a protease, a hybridization sample is formed by contacting the test sample 
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containing a protease, with at least one nucleic acid probe. A preferred probe for 
detecting mRNA or genomic DNA is a labeled nucleic acid probe capable of 
hybridizing to mRNA or genomic DNA sequences described herein. The nucleic 
acid probe can be, for example, a full-length nucleic acid molecule, or a portion 
5 thereof, such as an oligonucleotide of at least 15, 30, 50, 100, 250 or 500 nucleotides 
in length and sufficient to specifically hybridize under stringent conditions to 
appropriate mRNA or genomic DNA. For example, the nucleic acid probe can be all 
or a portion of one of SEQ ID NOs:l-47, or the complement thereof, or a portion 
thereof; or can be a nucleic acid encoding a portion of one of SEQ ID NOs:48-94. 
10 Other suitable probes for use in the diagnostic assays of the invention are described 
above (see e.g., probes and primers discussed under the heading, "Nucleic Acids of 
the Invention"). 

The hybridization sample is maintained under conditions which are sufficient 
to allow specific hybridization of the nucleic acid probe to a protease. "Specific 

1 5 hybridization", as used herein, indicates exact hybridization {e.g., with no 

mismatches). Specific hybridization can be performed under high stringency 
conditions or moderate stringency conditions, for example, as described above. In a 
particularly preferred embodiment, the hybridization conditions for specific 
hybridization are high stringency. 

20 Specific hybridization, if present, is then detected using standard methods. If 

specific hybridization occurs between the nucleic acid probe and the protease in the 
test sample, then the protease has the polymorphism, or is the splicing variant, that is 
present in the nucleic acid probe. More than one nucleic acid probe can also be used 
concurrently in this method. Specific hybridization of any one of the nucleic acid 

25 probes is indicative of a polymorphism in the protease, or of the presence of a 
particular splicing variant encoding the protease and is therefore diagnostic for a 
disease or a susceptibility to a disease or condition associated with a protease. 

In Northern analysis (see Current Protocols in Molecular Biology, Ausubel, 
F. et aL 9 eds., John Wiley & Sons, supra) the hybridization methods described above 

30 are used to identify the presence of a polymorphism or a particular splicing variant, 
associated with a disease or a susceptibility to a disease or condition associated with 
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a protease. For Northern analysis, a test sample of RNA is obtained from the 
individual by appropriate means. Specific hybridization of a nucleic acid probe, as 
described above, to RNA from the individual is indicative of a polymorphism in a 
protease, or of the presence of a particular splicing variant encoded by a protease, 
5 and is therefore diagnostic for a disease or a susceptibility to a disease or condition 
associated with a protease. 

For representative examples of use of nucleic acid probes, see, for example, 
U.S. Patents No. 5,288,611 and 4,851,330. 

Alternatively, a peptide nucleic acid (PNA) probe can be used instead of a 
1 0 nucleic acid probe in the hybridization methods described above. PNA is a DNA 
mimic having a peptide-like, inorganic backbone, such as N-(2-aminoethyl)glycine 
units, with an organic base (A, G, C, T or U) attached to the glycine nitrogen via a 
methylene carbonyl linker (see,, for example, Nielsen, P.E. et at., Bioconjugate 
Chemistry 5, American Chemical Society, p. 1 (1994). The PNA probe can be 
1 5 designed to specifically hybridize to a gene having a polymorphism associated with 
a disease or a susceptibility to a disease or condition associated with a protease. 
Hybridization of the PNA probe to a protease is diagnostic for a disease or a 
susceptibility to a disease or condition associated with a protease. 

In another method of the invention, mutation analysis by restriction digestion 
20 can be used to detect a mutant gene, or genes containing a polymorphism(s), if the 
mutation or polymorphism in the gene results in the creation or elimination of a 
restriction site. A test sample containing genomic DNA is obtained from the 
individual. Polymerase chain reaction (PCR) can be used to amplify a protease (and, 
if necessary, the fla nk ing sequences) in the test sample of genomic DNA from the 
25 test individual. RFLP analysis is conducted as described (see Current Protocols in 
Molecular Biology, supra). The digestion pattern of the relevant DNA fragment 
indicates the presence or absence of the mutation or polymorphism in the protease, 
and therefore indicates the presence or absence of this disease or a susceptibility to a 
disease or condition associated with a protease. 
30 Sequence analysis can also be used to detect specific polymorphisms in a 

protease. A test sample of DNA or RNA is obtained from the test individual. PCR 
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or other appropriate methods can be used to amplify the gene, and/or its flanking 
sequences, if desired. The sequence of a protease, or a fragment of the gene, or 
cDNA, or fragment of the cDNA, or mRNA, or fragment of the mRNA, is 
determined, using standard methods. The sequence of the gene, gene fragment, 
5 cDNA, cDNA fragment, niRNA, or mRNA fragment is compared with the known 
nucleic acid sequence of the gene, cDNA (e.g., one or more of SEQ ID NOs: 1-47, or 
a complement thereof, or a nucleic acid sequence encoding one of SEQ ID NOs:48- 
94 or a fragment thereof) or mRNA, as appropriate. The presence of a 
polymorphism in the protease indicates that the individual has a disease or a 

10 susceptibility to a disease or condition associated with a protease. 

Allele-specific oligonucleotides can also be used to detect the presence of a 
polymorphism in a protease, through the use of dot-blot hybridization of amplified 
oligonucleotides with allele-specific oligonucleotide (ASO) probes (see, for 
example, Saiki, R. et al, Nature 324:163-166 (1986)). An "allele-specific 

15 oligonucleotide" (also referred to herein as an "allele-specific oligonucleotide 
probe") is an oligonucleotide of approximately 10-50 base pairs, preferably 
approximately 15-30 base pairs, that specifically hybridizes to a protease, and that 
contains a polymorphism associated with a disease or a susceptibility to a disease or 
condition associated with a protease. An allele-specific oligonucleotide probe that is 

20 specific for particular polymorphisms in a protease can be prepared, using standard 
methods (see Current Protocols in Molecular Biology, supra). To identify 
polymorphisms in the gene that are associated with a disease or a susceptibility to a 
disease or condition associated with a protease, a test sample of DNA is obtained 
from the individual. PCR can be used to amplify all or a fragment of a protease, and 

25 its flanking sequences. The DNA containing the amplified protease (or fragment of 
the gene) is dot-blotted, using standard methods (see Current Protocols in 
Molecular Biology, supra), and the blot is contacted with the oligonucleotide probe. 
The presence of specific hybridization of the probe to the amplified protease is then 
detected. Specific hybridization of an allele-specific oligonucleotide probe to DNA 

30 from the individual is indicative of a polymorphism in the protease, and is therefore 
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indicative of a disease or a susceptibility to a disease or condition associated with a 
protease. 

An allele-specific primer hybridizes to a site on target DNA overlapping a 
polymorphism and only primes amplification of an allelic form to which the primer 
5 exhibits perfect complementarity. See Gibbs, Nucleic Acid Res. 17, 2427-2448 
(1989). This primer is used in conjunction with a second primer, which hybridizes 
at a distal site. Amplification proceeds from the two primers, resulting in a 
detectable product, which indicates the particular allelic form is present. A control 
is usually performed with a second pair of primers, one of which shows a single base 

1 0 mismatch at the polymorphic site and the other of which exhibits perfect 

complementarity to a distal site. The single-base mismatch prevents amplification 
and no detectable product is formed. The method works best when the mismatch is 
included in the 3 -most position of the oligonucleotide aligned with the 
polymorphism because this position is most destabilizing to elongation from the 

1 5 primer (see, e.g. , WO 93/22456). 

In another embodiment, arrays of oligonucleotide probes that are 
complementary to target nucleic acid sequence segments from an individual, can be 
used to identify polymorphisms in a protease. For example, in one embodiment, an 
oligonucleotide array can be used. Oligonucleotide arrays typically comprise a 

20 plurality of different oligonucleotide probes that are coupled to a surface of a 

substrate in different known locations. These oligonucleotide arrays, also described 
as "Genechips™," have been generally described in the art, for example, U.S. Pat. 
No. 5,143,854 and PCT patent publication Nos. WO 90/15070 and 92/10092. These 
arrays can generally be produced using mechanical synthesis methods or light 

25 directed synthesis methods which incorporate a combination of photolithographic 
methods and solid phase oligonucleotide synthesis methods. See Fodor et al 9 
Science 251:767-777 (1991), Pirrung et al, U.S. Pat. No. 5,143,854 (see also PCT 
Application No. WO 90/15070) and Fodor et al y PCT Publication No. WO 
92/10092 and U.S. Pat. No. 5,424,186, the entire teachings of each of which are 

30 incorporated by reference herein. Techniques for the synthesis of these arrays using 
mechanical synthesis methods are described in, e.g., U.S. Pat. Nos. 5,384,261, the 
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entire teachings of which are incorporated by reference herein. In another example, 
linear arrays can be utilized. 

Once an oligonucleotide array is prepared, a nucleic acid of interest is 
hybridized with the array and scanned for polymorphisms. Hybridization and 
5 scanning are generally carried out by methods described herein and also in, e.g., 
published PCT Application Nos. WO 92/10092 and WO 95/1 1995, and U.S. Pat. 
No. 5,424,186, the entire teachings of which are incorporated by reference herein. In 
brief, a target nucleic acid sequence which includes one or more previously 
identified polymorphic markers is amplified by well known amplification 

1 0 techniques, e.g. , PCR. Typically, this involves the use of primer sequences that are 
complementary to the two strands of the target sequence both upstream and 
downstream from the polymorphism. Asymmetric PCR techniques may also be 
used. Amplified target, generally incorporating a label, is then hybridized with the 
array under appropriate conditions. Upon completion of hybridization and washing 

15 of the array, the array is scanned to determine the position on the array to which the 
target sequence hybridizes. The hybridization data obtained from the scan is 
typically in the form of fluorescence intensities as a function of location on the array. 

Although primarily described in terms of a single detection block, e.g., for 
detection of a single polymorphism, arrays can include multiple detection blocks, 

20 and thus be capable of analyzing multiple, specific polymorphisms. In alternate 
arrangements, it will generally be understood that detection blocks may be grouped 
within a single array or in multiple, separate arrays so that varying, optimal 
conditions may be used during the hybridization of the target to the array, For 
example, it may often be desirable to provide for the detection of those 

25 polymorphisms that fall within G-C rich stretches of a genomic sequence, separately 
from those falling in A-T rich segments. This allows for the separate optimization 
of hybridization conditions for each situation. 

Additional description of use of oligonucleotide arrays for detection of 
polymorphisms can be found, for example, in U.S. Patents 5,858,659 and 5,837,832, 

30 the entire teachings of which are incorporated by reference herein. 

Other methods of nucleic acid analysis can be used to detect polymorphisms 
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in a protease or variants encoding by a protease. Representative methods include 
direct manual sequencing (Church and Gilbert, Proc. Natl Acad, Sci. USA 81:1991- 
1995 (1988); Sanger, F. et al, Proc. Natl. Acad. Sci. USA 74:5463-5467 (1977); 
Beavis et al. U.S. Pat. No. 5,288,644); automated fluorescent sequencing; single- 

5 stranded conformation polymorphism assays (SSCP); clamped denaturing gel 

electrophoresis (CDGE); denaturing gradient gel electrophoresis (DGGE) (Sheffield, 
V.C. et al, Proc. Natl Acad. Sci. USA 86:232-236 (1989)), mobility shift analysis 
(Orita, M. et al, Proc. Natl Acad. Sci. USA 86:2766-2770 (1989)), restriction 
enzyme analysis (Flavell etal, Cell 15:25 (1978); Geever, et al, Proc. Natl Acad. 

1 0 Sci. USA 78:5081 (1981)); heteroduplex analysis; chemical mismatch cleavage 
(CMC) (Cotton et al, Proc. Natl. Acad. Sci. USA 85:4397-4401 (1985)); RNase 
protection assays (Myers, R.M. et al, Science 230:1242 (1985)); use of polypeptides 
which recognize nucleotide mismatches, such as E. coli mutS protein; allele-speciiBc 
PCR, for example. 

15 In one embodiment of the invention, diagnosis of a disease or condition 

associated with a nucleic acid or a susceptibility to a disease or condition associated 
with a nucleic acid can also be made by expression analysis by quantitative PCR 
(kinetic thermal cycling). This technique utilizing TaqMan® can be used to allow 
the identification of polymorphisms and whether a patient is homozygous or 

20 heterozygous. The technique can assess the presence of an alteration in the 

expression or composition of the polypeptide encoded by the nucleic acid or splicing 
variants encoded by the nucleic acid. Further, the expression of the variants can be 
quantified as physically or functionally different. 

In another embodiment of the invention, diagnosis of a disease or a 

25 susceptibility to a disease or condition associated with a protease can also be made 
by examining expression and/or composition of a protease polypeptide, by a variety 
of methods, including enzyme linked immunosorbent assays (ELISAs), Western 
blots, immunoprecipitations and immunofluorescence. A test sample from an 
individual is assessed for the presence of an alteration in the expression and/or an 

30 alteration in composition of the polypeptide encoded by a protease, or for the 

presence of a particular variant encoded by a protease. An alteration in expression 
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of a polypeptide encoded by a protease can be, for example, an alteration in the 
quantitative polypeptide expression {i.e., the amount of polypeptide produced); an 
alteration in the composition of a polypeptide encoded by a protease is an alteration 
in the qualitative polypeptide expression (e.g., expression of a mutant protease 
5 polypeptide or of a different splicing variant). In a preferred embodiment, diagnosis 
of a disease or a susceptibility to a disease or condition associated with a protease is 
made by detecting a particular splicing variant encoded by that protease, or a 
particular pattern of splicing variants. 

Both such alterations (quantitative and qualitative) can also be present. An 

10 "alteration" in the polypeptide expression or composition, as used herein, refers to 
an alteration in expression or composition in a test sample, as compared with the 
expression or composition of polypeptide by a protease in a control sample. A 
control sample is a sample that corresponds to the test sample (e.g., is from the same 
type of cells), and is from an individual who is not affected by a susceptibility to a 

1 5 disease or condition associated with a protease. An alteration in the expression or 
composition of the polypeptide in the test sample, as compared with the control 
sample, is indicative of a disease or a susceptibility to a disease or condition 
associated with a protease. Similarly, the presence of one or more different splicing 
variants in the test sample, or the presence of significantly different amounts of 

20 different splicing variants in the test sample, as compared with the control sample, is 
indicative of a disease or a susceptibility to a disease or condition associated with a 
protease. Various means of examining expression or composition of the polypeptide 
encoded by a protease can be used, including spectroscopy, colorimetry, 
electrophoresis, isoelectric focusing, and immunoassays (e.g., David et al, U.S. Pat. 

25 No. 4,376,1 10) such as immunoblotting (see also Current Protocols in Molecular 
Biology, particularly Chapter 10). For example, in one embodiment, an antibody 
capable of binding to the polypeptide (e.g., as described above), preferably an 
antibody with a detectable label, can be used. Antibodies can be polyclonal, or more 
preferably, monoclonal. An intact antibody, or a fragment thereof (e.g., Fab or 

30 F(ab , ) 2 ) can be used. The term "labeled", with regard to the probe or antibody, is 
intended to encompass direct labeling of the probe or antibody by coupling (ie, 
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physically linking) a detectable substance to the probe or antibody, as well as 
indirect labeling of the probe or antibody by reactivity with another reagent that is 
directly labeled. Examples of indirect labeling include detection of a primary 
antibody using a fluorescently labeled secondary antibody and end-labeling of a 
5 DNA probe with biotin such that it can be detected with fluorescently labeled 
streptavidin. 

Western blotting analysis, using an antibody as described above that 
specifically binds to a polypeptide encoded by a mutant protease, or an antibody that 
specifically binds to a polypeptide encoded by a non-mutant gene, or an antibody 

1 0 that specifically binds to a particular splicing variant encoded by a protease, can be 
used to identify the presence in a test sample of a particular splicing variant or of a 
polypeptide encoded by a polymorphic or mutant protease, or the absence hi a test 
sample of a particular splicing variant or of a polypeptide encoded by a non- 
polymorphic or non-mutant gene. The presence of a polypeptide encoded by a 

1 5 polymorphic or mutant gene, or the absence of a polypeptide encoded by a non- 
polymorphic or non-mutant gene, is diagnostic for a disease or a susceptibility to a 
disease or condition associated with a protease, as is the presence (or absence) of 
particular splicing variants encoded by the protease gene. 

In one embodiment of this method, the level or amount of polypeptide 

20 encoded by a protease in a test sample is compared with the level or amount of the 
polypeptide encoded by the protease in a control sample. A level or amount of the 
polypeptide in the test sample that is higher or lower than the level or amount of the 
polypeptide in the control sample, such that the difference is statistically significant, 
is indicative of an alteration in the expression of the polypeptide encoded by the 

25 protease, and is diagnostic for a disease or a susceptibility to a disease or condition 
associated with that protease. Alternatively, the composition of the polypeptide 
encoded by a protease in a test sample is compared with the composition of the 
polypeptide encoded by the protease in a control sample (e.g., the presence of 
different splicing variants). A difference in the composition of the polypeptide in 

30 the test sample, as compared with the composition of the polypeptide in the control 
sample, is diagnostic for a disease or a susceptibility to a disease or condition 
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associated with that protease. In another embodiment, both the level or amount and 
the composition of the polypeptide can be assessed in the test sample and in the 
control sample. A difference in the amount or level of the polypeptide in the test 
sample, compared to the control sample; a difference in composition in die test 
5 sample, compared to the control sample; or both a difference in the amount or level, 
and a difference in the composition, is indicative of a disease or a susceptibility to a 
disease or condition associated with that protease. 

The invention further pertains to a method for the diagnosis and 
identification of susceptibility to disease in an individual, by identifying an at-risk 

10 haplotype in a protease gene. In one embodiment, the at-risk haplotype is one which 
confers a significant risk of a disease associated with a protease, hi one 
embodiment, significance associated with a haplotype is measured by an odds ratio. 
In a further embodiment, the significance is measured by a percentage. In one 
embodiment, a significant risk is measured as an odds ratio of at least about 2.2, 

15 including by not limited to: 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, and 1.9. Inafurther 

embodiment, an odds ratio of at least 1.2 is significant. In a further embodiment, an 
odds ratio of at least about 1.5 is significant. In a further embodiment, a significant 
increase in risk is at least about 1.7 is significant. In a further embodiment, a 
significant increase in risk is at least about 20%, including but not limited to about 

20 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 
95%, and 98%. In a further embodiment, a significant increase in risk is at least 
about 50%. It is understood however, that identifying whether a risk is medically 
significant may also depend on a variety of factors, including the specific disease, 
the haplotype, and often, environmental factors. 

25 The invention also pertains to methods of diagnosing a disease or a 

susceptibility to a disease associated with a protease in an individual, comprising 
screening for an at-risk haplotype in the nucleic acid that is more frequently present 
in an individual susceptible to a protease associated disease (affected), compared to 
the frequency of its presence in a healthy individual (control), wherein the presence 

30 of the haplotype is indicative of a disease or a susceptibility to a protease associated 
disease. Standard techniques for genotyping for the presence of SNPs and/or 
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raicrosatellite markers that are associated with a protease gene can be used, such as 
fluorescent based techniques (Chen, et al., Genome Res. 9, 492 (1999)), PCR, LCR, 
Nested PCR and other techniques for nucleic acid amplification. In a certain 
embodiment, the method comprises assessing in an individual the presence or 
5 frequency of SNPs and/or microsatellites in the nucleic acid that are associated with 
a protease associated disease, wherein an excess or higher frequency of the SNPs 
and/or microsatellites compared to a healthy control individual is indicative that the 
individual has a protease associated disease or is susceptible to a protease associated 
disease. 

10 Kits (e.g., reagent kits) useful in the methods of diagnosis comprise 

components useful in any of the methods described herein, including for example, 
hybridization probes or primers as decribed herein (e.g., labeled probes or primers), 
reagents for detection of labeled molecules, restriction enzymes {e.g., for RPLP 
analysis), allele-specific oligonucleotides, antibodies which bind to altered or to non- 

15 altered (native) protease polypeptide, means for amplification of nucleic acids 
comprising a protease, or means for analyzing the nucleic acid sequence of a 
protease or for analyzing the amino acid sequence of a protease polypeptide, etc. 

In one embodiment, a kit for diagnosing a protease associated disease or 
susceptibility to a protease associated disease can comprise primers for nucleic acid 

20 amplification of a region in a nucleic acid comprising an at-risk haplotype that, is 
more frequently present in an individual having a protease associated disease or is 
susceptible to a protease associated disease. The primers can be designed using 
protion of the nucleic acids flanking SNPs that are indicative of a protease 
associated disease. 



25 SCREENING ASSAYS AND AGENTS IDENTIFIED THEREBY 

The invention provides methods (also referred to herein as "screening 
assays") for identifying the presence of a nucleotide that hybridizes to a nucleic acid 
of the invention, as well as for identifying the presence of a polypeptide encoded by 
a nucleic acid of the invention. In one embodiment, the presence (or absence) of a 

30 nucleic acid molecule of interest (e.g. 9 a nucleic acid that has significant homology 
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with a nucleic acid of the invention) in a sample can be assessed by contacting the 
sample with a nucleic acid comprising a nucleic acid of the invention (e.g., a nucleic 
acid having the sequence of one of SEQ ID NOs:l-47, or the complement thereof, or 
a nucleic acid encoding an amino acid having the sequence of one of SEQ ED 
5 NOs:48-94, or a fragment or variant of such nucleic acids), under stringent 

conditions as described above, and then assessing the sample for the presence (or 
absence) of hybridization. In a preferred embodiment, high stringency conditions 
are conditions appropriate for selective hybridization. In another embodiment, a 
sample containing the nucleic acid molecule of interest is contacted with a nucleic 
1 0 acid containing a contiguous nucleotide sequence {e.g., a primer or a probe as 

described above) that is at least partially complementary to a part of the nucleic acid 
molecule of interest {e.g., a protease nucleic acid), and the contacted sample is 
assessed for the presence or absence of hybridization. In a preferred embodiment, 
the nucleic acid containing a contiguous nucleotide sequence is completely 
1 5 complementary to a part of the nucleic acid molecule of interest. 

In any of these embodiments, all or a portion of the nucleic acid of interest 
can be subjected to amplification prior to performing the hybridization. 

In another embodiment, the presence (or absence) of a polypeptide of 
interest, such as a polypeptide of the invention or a fragment or variant thereof, in a 
20 sample can be assessed by contacting the sample with an antibody that specifically 
hybridizes to the polypeptide of interest {e.g., an antibody such as those described 
above), and then assessing the sample for the presence (or absence) of binding of the 
antibody to the polypeptide of interest. 

hi another embodiment, the invention provides methods for identifying 
25 agents {e.g., fusion proteins, polypeptides, peptidomimetics, prodrugs, receptors, 

binding agents, antibodies, small molecules or other drugs, or ribozymes which alter 
{e.g., increase or decrease) the activity of the polypeptides described herein, or 
which otherwise interact with the polypeptides herein. For example, such agents can 
be agents which bind to polypeptides described herein {e.g., protease binding 
30 agents); which have a stimulatory or inhibitory effect on, for example, activity of 
polypeptides of the invention; or which change {e.g., enhance or inhibit) the ability 
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of the polypeptides of the invention to interact with protease binding agents (e.g., 
receptors or other binding agents); or which alter posttranslational processing of the 
protease polypeptide (e.g., agents that alter proteolytic processing to direct the 
polypeptide from where it is normally synthesized to another location in the cell, 
5 such as the cell surface; agents that alter proteolytic processing such that more 
polypeptide is released from the cell, etc. 

In one embodiment, the invention provides assays for screening candidate or 
test agents that bind to or modulate the activity of polypeptides described herein (or 
biologically active portion(s) thereof), as well as agents identifiable by the assays. 

10 Test agents can be obtained using any of the numerous approaches in . combinatorial 
library methods known in the art, including: biological libraries; spatially 
addressable parallel solid phase or solution phase libraries; synthetic library methods 
requiring deconvolution; the 'one-bead one-compound' library method; and 
synthetic library methods using affinity chromatography selection. The biological 

1 5 library approach is limited to polypeptide libraries, while the other four approaches 
are applicable to polypeptide, non-peptide oligomer or small molecule libraries of 
compounds (Lam, K.S., Anticancer Drug Des. 12:145 (1997)). 

In one embodiment, to identify agents which alter the activity of a protease 
polypeptide, a cell, cell lysate, or solution containing or expressing a protease 

20 polypeptide (e.g. , one of SEQ ID NOs:48-94, or another splicing variant encoded by 
a protease), or a fragment or derivative thereof (as described above), can be 
contacted with an agent to be tested; alternatively, the polypeptide can be contacted 
directly with the agent to be tested. The level (amount) of protease activity is 
assessed (e.g., the level (amount) of protease activity is measured, either directly or 

25 indirectly), and is compared with the level of activity in a control (i.e., the level of 
activity of the protease polypeptide or active fragment or derivative thereof in the 
absence of the agent to be tested). If the level of the activity in the presence of the 
agent differs, by an amount that is statistically significant, from the level of the 
activity in the absence of the agent, then the agent is an agent that alters the activity 

30 of a protease polypeptide. An increase in the level of protease activity relative to a 
control, indicates that the agent is an agent that enhances (is an agonist of) protease 
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activity. Similarly, a decrease in the level of protease activity relative to a control, 
indicates that the agent is an agent that inhibits (is an antagonist of) protease activity. 
In another embodiment, the level of activity of a protease polypeptide or derivative 
or fragment thereof in the presence of the agent to be tested, is compared with a 
5 control level that has previously been established. A level of the activity in the 
presence of the agent that differs from the control level by an amount that is 
statistically significant indicates that the agent alters protease activity. 

The present invention also relates to an assay for identifying agents which 
alter the expression of a protease gene (e.g., antisense nucleic acids, fusion proteins, 

1 0 polypeptides, peptidomimetics, prodrugs, receptors, binding agents, antibodies, 

small molecules or other drugs, or ribozymes) which alter (e.g., increase or decrease) 
expression (e.g., transcription or translation) of the gene or which otherwise interact 
with the nucleic acids described herein, as well as agents identifiable by the assays. 
For example, a solution containing a nucleic acid encoding a protease polypeptide 

1 5 (e.g., a protease gene) can be contacted with an agent to be tested. The solution can 
comprise, for example, cells containing the nucleic acid or cell lysate containing the 
nucleic acid; alternatively, the solution can be another solution which comprises 
elements necessary for transcription/translation of the nucleic acid. Cells not 
suspended in solution can also be employed, if desired. The level and/or pattern of 

20 protease expression (e.g., the level and/or pattern of mRNA or of protein expressed, 
such as the level and/or pattern of different splicing variants) is assessed, and is 
compared with the level and/or pattern of expression in a control (i.e., the level 
and/or pattern of the protease expression in the absence of the agent to be tested). If 
the level and/or partem in the presence of the agent differs, by an amount or in a 

25 manner that is statistically significant, from the level and/or pattern in the absence of 
the agent, then the agent is an agent that alters the expression of a protease. 
Enhancement of protease expression indicates that the agent is an agonist of protease 
activity. Similarly, inhibition of protease expression indicates that the agent is an 
antagonist of protease activity. In another embodiment, the level and/or pattern of 

30 protease polypeptide(s) (e.g. , different splicing variants) in the presence of the agent 
to be tested, is compared with a control level and/or pattern that has previously been 
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established. A level and/or pattern in the presence of the agent that differs from the 
• control level and/or pattern by an amount or in a manner that is statistically 
significant indicates that the agent alters protease expression. Such methods can be 
used to identify compounds that inhibit proteases involved in infection by pathogens. 
5 In another embodiment of the invention, agents which alter the expression of 

a protease gene or which otherwise interact with the nucleic acids described herein, 
can be identified using a cell, cell lysate, or solution containing a nucleic acid 
encoding the promoter region of the protease gene operably linked to a reporter 
gene. After contact with an agent to be tested, the level of expression of the reporter 

10 gene (e.g., the level of mRNA or of protein expressed) is assessed, and is compared 
with the level of expression in a control (i.e., the level of the expression of the 
reporter gene in the absence of the agent to be tested). If the level in the presence of 
the agent differs, by an amount or in a manner that is statistically significant, from 
the level in the absence of the agent, then the agent is an agent that alters the 

15 expression of the protease, as indicated by its ability to alter expression of a gene 
. that is operably linked to the protease gene promoter. Enhancement of the 

expression of the reporter indicates that the agent is an agonist of protease activity. 
Similarly, inhibition of the expression of the reporter indicates that the agent is an 
antagonist of protease activity. In another embodiment, the level of expression of 

20 the reporter in the presence of the agent to be tested, is compared with a control level 
that has previously been established. A level in the presence of the agent that differs 
from the control level by an amount or in a manner that is statistically significant 
indicates that the agent alters expression. 

Agents which alter the amounts of different splicing variants encoded by a 

25 protease (e.g., an agent which enhances activity of a first splicing variant, and which 
inhibits activity of a second splicing variant), as well as agents which are agonists of 
activity of a first splicing variant and antagonists of activity of a second splicing 
variant, can easily be identified using these methods described above. 

In other embodiments of the invention, assays can be used to assess the 

30 impact of a test agent on the activity of a polypeptide in relation to a protease 

binding agent. For example, a cell that expresses a compound that interacts with a 
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protease (herein referred to as a "protease binding agent", which can be a 
polypeptide or other molecule that interacts with a protease, such as a receptor) is 
contacted with a protease in the presence of a test agent, and the ability of the test 
agent to alter the interaction between the protease and the protease binding agent is 

5 determined. Alternatively, a cell lysate or a solution containing the protease binding 
agent, can be used. An agent which binds to the protease or the protease binding 
agent can alter the interaction by interfering with, or enhancing the ability of the 
protease to bind to, associate with, or otherwise interact with the protease binding 
agent. Determining the ability of the test agent to bind to a protease or a protease 

10 binding agent can be accomplished, for example, by coupling the test agent with a 
radioisotope or enzymatic label such that binding of the test agent to the polypeptide 
can be determined by detecting the labeled with n \ 35 S, 14 C or 3 H, either directly or 
indirectly, and the radioisotope detected by direct counting of radioemmission or by 
scintillation counting. Alternatively, test agents can be enzymatically labeled with, 

15 for example, horseradish peroxidase, alkaline phosphatase, or luciferase, and the 

enzymatic label detected by determination of conversion of an appropriate substrate 
to product. It is also within the scope of this invention to determine the ability of a 
test agent to interact with the polypeptide without the labeling of any of the 
interactants. For example, a microphysiometer can be used to detect the interaction 

20 of a test agent with a protease or a protease binding agent without the labeling of 
either the test agent, protease, or the protease binding agent. McConnell, H.M. et 
aL, Science 257:1906-1912 (1992). As used herein, a "microphysiometer" (e.g., 
Cytosensor™) is an analytical instrument that measures the rate at which a cell 
acidifies its environment using a light-addressable potentiometric sensor (LAPS). 

25 Changes in this acidification rate can be used as an indicator of the interaction 
between ligand and polypeptide. Thus, these receptors can be used to screen for 
compounds that are agonists for use in treating a susceptibility to a disease or 
condition associated with a protease or antagonists for studying a susceptibility to a 
disease or condition associated with a protease. Drugs could be designed to regulate 

30 protease activation which in turn can be used to regulate signaling pathways and 
transcription events of genes downstream. 
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In another embodiment of the invention, assays can be used to identify 
polypeptides that interact with one or more protease polypeptides, as described 
herein. For example, a yeast two-hybrid system such as that described by Fields and 
Song (Fields, S. and Song, O., Nature 340:245-246 (1989)) can be used to identify 
5 polypeptides that interact with one or more protease polypeptides. In such a yeast 
two-hybrid system, vectors are cpnstructed based on the flexibility of a transcription 
factor which has two functional domains (a DNA binding domain and a transcription 
activation domain). If the two domains are separated but fused to two different 
proteins that interact with one another, transcriptional activation can be achieved,., 

10 and transcription of specific markers (e.g., nutritional markers such as His and Ade, 
or color markers such as lacZ) can be used to identify the presence of interaction and 
transcriptional activation. For example, in the methods of the invention, a first 
vector is used which includes a nucleic acid encoding a DNA binding domain and 
also a protease polypeptide, splicing variant, or fragment or derivative thereof, and a 

15 second vector is used which includes a nucleic acid encoding a transcription 

activation domain and also a nucleic acid encoding a polypeptide which potentially 
may interact with the protease polypeptide, splicing variant, or fragment or 
derivative thereof (e.g., a protease polypeptide binding agent or receptor). 
Incubation of yeast containing the first vector and the second vector under 

20 appropriate conditions (e.g., mating conditions such as used in the Matchmaker™ 

system from Clontech (Palo Alto, California, USA)) allows identification of colonies 
which express the markers of interest. These colonies can be examined to identify 
the polypeptide(s) which interact with the protease polypeptide or fragment or 
derivative thereof. Such polypeptides may be useful as agents which alter the 

25 activity of expression of a protease polypeptide, as described above. 

In more than one embodiment of the above assay methods of the present 
invention, it may be desirable to immobilize either the protease, the protease binding 
agent, or other components of the assay on a solid support, in order to facilitate 
separation of complexed from uncomplexed forms of one or both of the 

30 polypeptides, as well as to accommodate automation of the assay. Binding of a test 
agent to the polypeptide, or interaction of the polypeptide with a binding agent in the 
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presence and absence of a test agent, can be accomplished in any vessel suitable for 
containing the reactants. Examples of such vessels include microtitre plates, test 
tubes, and micro-centrifuge tubes. In one embodiment, a fusion protein (e.g., a 
glutathione-S -transferase fusion protein) can be provided which adds a domain that 
5 allows a protease or a protease binding agent to be bound to a matrix or other solid 
support. 

In another embodiment, modulators of expression of nucleic acid molecules 
of the invention are identified in a method wherein a cell, cell lysate, or solution 
containing a nucleic acid encoding a protease is contacted with a test agent and the 

10 expression of appropriate mRNA or polypeptide (e.g., splicing variant(s)) in the cell, 
cell lysate, or solution, is determined. The level of expression of appropriate mRNA 
or polypeptide(s) in the presence of the test agent is compared to the level of 
expression of mRNA or polypeptide(s) in the absence of the test agent. The test 
agent can then be identified as a modulator of expression based on this comparison. 

1 5 For example, when expression of mRNA or polypeptide is greater (statistically 
significantly greater) in the presence of the test agent than in its absence, the test 
agent is identified as a stimulator or enhancer of the mRNA or polypeptide 
expression. Alternatively, when expression of the mRNA or polypeptide is less 
(statistically significantly less) in the presence of the test agent than in its absence, 

20 the test agent is identified as an inhibitor of the mRNA or polypeptide expression. 
The level of mRNA or polypeptide expression in the cells can be determined by 
methods described herein for detecting mRNA or polypeptide. 

This invention further pertains to novel agents identified by the 
above-described screening assays. Accordingly, it is within the scope of this 

25 invention to further use an agent identified as described herein in an appropriate 
animal model. For example, an agent identified as described herein (e.g., a test 
agent that is a modulating agent, an antisense nucleic acid molecule, a specific 
antibody, or a polypeptide-binding agent) can be used in an animal model to 
determine the efficacy, toxicity, or side effects of treatment with such an agent. 

30 Alternatively, an agent identified as described herein can be used in an animal model 
to determine the mechanism of action of such an agent. Furthermore, this invention 



WO 03/040393 



PCT/IB02/04615 



-52- 

pertains to uses of novel agents identified by the above-described screening assays 
for treatments as described herein. In addition, an agent identified as described 
herein can be used to alter activity of a polypeptide encoded by a protease, or to alter 
expression of a protease, by contacting the polypeptide or the gene (or contacting a 
5 cell comprising the polypeptide or the gene) with the agent identified as described 
herein. 

PHARMACEUTICAL COMPOSITIONS 

The present invention also pertains to pharmaceutical compositions 
comprising nucleic acids described herein, particularly nucleotides encoding the 

1 0 polypeptides described herein; comprising polypeptides described herein (e.g. , one 
or more of SEQ ID NOs:48-94); and/or comprising other splicing variants encoded 
by a protease; and/or an agent that alters (e.g., enhances or inhibits) protease gene 
expression or protease polypeptide activity as described herein. For instance, a 
polypeptide, protein (e.g., a protease receptor), an agent that alters protease gene 

1 5 expression, or a protease binding agent or binding partner, fragment, fusion protein 
or prodrug thereof, or a nucleotide or nucleic acid construct (vector) comprising a 
nucleotide of the present invention, or an agent that alters protease polypeptide 
activity, can be formulated with a physiologically acceptable carrier or excipient to 
prepare a pharmaceutical composition. The carrier and composition can be sterile. 

20 The formulation should suit the mode of administration. 

Suitable pharmaceutically acceptable carriers include but are not limited to 
water, salt solutions (e.g., NaCl), saline, buffered saline, alcohols, glycerol, ethanol, 
gum arabic, vegetable oils, benzyl alcohols, polyethylene glycols, gelatin, 
carbohydrates such as lactose, amylose or starch, dextrose, magnesium stearate, talc, 

25 silicic acid, viscous paraffin, perfume oil, fatty acid esters, hydroxymethylcellulose, 
polyvinyl pyrolidone, etc., as well as combinations thereof. The pharmaceutical 
preparations can, if desired, be mixed with auxiliary agents, e.g., lubricants, 
preservatives, stabilizers, wetting agents, emulsifiers, salts for influencing osmotic 
pressure, buffers, coloring, flavoring and/or aromatic substances and the like which 

30 do not deleteriously react with the active agents. 
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The composition, if desired, can also contain minor amounts of wetting or 
emulsifying agents, or pH buffering agents. The composition can be a liquid 
solution, suspension, emulsion, tablet, pill, capsule, sustained release formulation, or 
powder. The composition can be formulated as a suppository, with traditional 
5 binders and carriers such as triglycerides. Oral formulation can include standard 
carriers such as pharmaceutical grades of mannitol, lactose, starch, magnesium 
stearate, polyvinyl pyrollidone, sodium saccharine, cellulose, magnesium carbonate, 
etc. 

Methods of introduction of these compositions include, but are not limited 

10 to, intradermal, intramuscular, intraperitoneal, intraocular, intravenous, 

subcutaneous, topical, oral and intranasal. Other suitable methods of introduction 
can also include gene therapy (as described below), rechargeable or biodegradable 
devices, particle acceleration devises ("gene guns") and slow release polymeric 
devices. The pharmaceutical compositions of this invention can also be 

1 5 administered as part of a combinatorial therapy with other agents. 

The composition can be formulated in accordance with the routine 
procedures as a pharmaceutical composition adapted for administration to human 
beings. For example, compositions for intravenous administration typically are 
solutions in sterile isotonic aqueous buffer. Where necessary, the composition may 

20 also include a solubilizing agent and a local anesthetic to ease pain at the site of the 
injection. Generally, the ingredients are supplied either separately or mixed together 
in unit dosage form, for example, as a dry lyophilized powder or water free 
concentrate in a hermetically sealed container such as an ampule or sachette 
indicating the quantity of active agent. Where the composition is to be administered 

25 by infusion, it can be dispensed with an infusion bottle containing sterile 

pharmaceutical grade water, saline or dextrose/water. Where the composition is 
administered by injection, an ampule of sterile water for injection or saline can be 
provided so that the ingredients maybe mixed prior to administration. 

For topical application, nonsprayable forms, viscous to semi-solid or solid 

30 forms comprising a carrier compatible with topical application and having a dynamic 
viscosity preferably greater than water, can be employed. Suitable formulations 
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include but are not limited to solutions, suspensions, emulsions, creams, ointments, 
powders, enemas, lotions, sols, liniments, salves, 'aerosols, etc., which are, if desired, 
sterilized or mixed with auxiliary agents, e.g., preservatives, stabilizers, wetting 
agents, buffers or salts for influencing osmotic pressure, etc. The agent may be 
5 incorporated into a cosmetic formulation. For topical application, also suitable are 
sprayable aerosol preparations wherein the active ingredient, preferably in 
combination with a solid or liquid inert carrier material, is packaged in a squeeze 
bottle or in admixture with a pressurized volatile, normally gaseous propellant, e.g., 
pressurized air. 

10 Agents described herein can be formulated as neutral or salt forms. 

Pharmaceutically acceptable salts include those formed with free amino groups such 
as those derived from hydrochloric, phosphoric, acetic, oxalic, tartaric acids, etc., 
and those formed with free carboxyl groups such as those derived from sodium, 
potassium, ammonium, calcium, ferric hydroxides, isopropylamine, triethylamine, 2- 

15 ethylamino ethanol, histidine, procaine, etc. 

The agents are administered in a therapeutically effective amount. The 
amount of agents which will be therapeutically effective in the treatment of a 
particular disorder or condition will depend on the nature of the disorder or 
condition, and can be determined by standard clinical techniques. In addition, in 

20 vitro or in vivo assays may optionally be employed to help identify optimal dosage 
ranges. The precise dose to be employed in the formulation will also depend on the 
route of a(urrinistration, and the seriousness of the symptoms of a susceptibility to a 
disease or condition associated with a protease, and should be decided according to 
the judgment of a practitioner and each patient's circumstances. Effective doses 

25 maybe extrapolated from dose-response curves derived from in vitro or animal 
model test systems. 

The invention also provides a pharmaceutical pack or kit comprising one or 
more containers filled with one or more of the ingredients of the pharmaceutical 
compositions of the invention. Optionally associated with such container(s) can be a 

30 notice in the form prescribed by a governmental agency regulating the manufacture, 
use or sale of pharmaceuticals or biological products, which notice reflects approval 
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by the agency of manufacture, use of sale for human administration. The pack or kit 
can be labeled with information regarding mode of administration, sequence of drug 
administration {e.g. , separately, sequentially or concurrently), or the like. The pack 
or kit may also include means for reminding the patient to take the therapy. The 

5 pack or kit can be a single unit dosage of the combination therapy or it can be a 
plurality of unit dosages. In particular, the agents can be separated, mixed together 
in any combination, present in a single vial or tablet. Agents assembled in a blister 
pack or other dispensing means is preferred. For the purpose of this invention, unit 
dosage is intended to mean a dosage that is dependent on the individual 

10 pharmacodynamics of each agent and administered in FDA approved dosages in 
standard time courses. 

METHODS OF THERAPY 

The present invention also pertains to methods of treatment (prophylactic 
and/or therapeutic) for a disease or condition associated with a protease or 

1 5 susceptibility to a disease or condition associated with a protease, using a protease 
therapeutic agent. A "protease therapeutic agent" is an agent that alters {e.g., 
enhances or inhibits) protease polypeptide activity and/or protease gene expression, 
as described herein {e.g., a protease agonist or antagonist). Protease therapeutic 
agents can alter protease polypeptide activity or gene expression by a variety of 

20 means, such as, for example, by providing additional protease polypeptide or by 
upregulating the transcription or translation of the protease gene; by altering 
posttranslational processing of the protease polypeptide; by altering transcription of 
protease splicing variants; or by interfering with protease polypeptide activity {e.g., 
by binding to a protease polypeptide), or by downregulating the transcription or 

25 translation of a protease gene. Representative protease therapeutic agents include 
the following: 

nucleic acids or fragments or derivatives thereof described herein, 
particularly nucleotides encoding the polypeptides described herein and vectors 
comprising such nucleic acids {e.g, a gene, cDNA, and/or mRNA, such as a nucleic 
30 acid encoding a protease polypeptide or active fragment or derivative thereof, or an 
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oligonucleotide; for example, one of SEQ 3D NOs:l-47, or a complement thereof, or 
a nucleic acid encoding one of SEQ ID NOs:48-94, or fragments or derivatives 
thereof); 

polypeptides described herein (e.g., one pr more of SEQ ID NOs:48~94, 
5 and/or other splicing variants encoded by a protease, or fragments or derivatives 
thereof); 

other polypeptides (e.g., protease receptors); protease binding agents; 
peptidomimetics; fusion proteins or prodrugs thereof; antibodies (e.g., an antibody to 
a mutant protease polypeptide, or an antibody to a non-mutant protease polypeptide, 
10 or an antibody to a particular splicing variant encoded by a protease, as described 
above); ribozymes; other small molecules; and 

other agents that alter (e.g., enhance or inhibit) protease gene expression or 
polypeptide activity, or that regulate transcription of protease splicing variants (e.g., 
agents that affect which splicing variants are expressed, or that affect the amount of 
1 5 each splicing, variant that is expressed. 

More than one protease therapeutic agent can be used concurrently, if 
desired. 

A protease therapeutic agent that is a nucleic acid is used in the treatment of 
a susceptibility to a disease or condition associated with a protease. The term, 

20 "treatment" as used herein, refers not only to ameliorating symptoms associated with 
the disease, but also preventing or delaying the onset of the disease, and also 
lessening the severity or frequency of symptoms of the disease. The therapy is 
designed to alter (e.g., inhibit or enhance), replace or supplement activity of a 
protease polypeptide in an individual. For example, a protease therapeutic agent can 

25 be administered in order to upregulate or increase the expression or availability of 
the protease gene or of specific splicing variants of protease, or, conversely, to 
downregulate or decrease the expression or availability of the protease gene or 
specific splicing variants of the protease. Upregulation or increasing expression or 
availability of a native protease gene or of a particular splicing variant could 

30 interfere with or compensate for the expression or activity of a defective gene or 

another splicing variant; downregulation or decreasing expression or availability of a 
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native protease gene or of a particular splicing variant could minimize the expression 
or activity of a defective gene or the particular splicing variant and thereby minimize 
the impact of the defective gene or the particular splicing variant. 

The protease therapeutic agent(s) are administered in a therapeutically 
5 effective amount (z. e. , an amount that is sufficient to treat the disease, such as by 
ameliorating symptoms associated with the disease, preventing or delaying the onset 
of the disease, and/or also lessening the severity or frequency of symptoms of the 
disease). The amount which will be therapeutically effective in the treatment of a 
particular individual's disorder or condition will depend on the symptoms and 
10 severity of the disease, and can be determined by standard. clinical techniques. In 
addition, in vitro or in vivo assays may optionally be employed to help identify 
optimal dosage ranges. The precise dose to be employed in the formulation will also 
depend on the route of administration, and the seriousness of the disease or disorder, 
and should be decided according to the judgment of a practitioner and each patient's 
1 5 circumstances. Effective doses may be extrapolated from dose-response curves 
derived from in vitro or animal model test systems. 

In one embodiment, a nucleic acid of the invention {e.g., a nucleic acid 
encoding a protease polypeptide, such as one of SEQ ID NOs:l-47, or a complement 
thereof; or another nucleic acid that encodes a protease polypeptide or a splicing 
20 variant, derivative or fragment thereof, such as a nucleic acid encoding one of SEQ 
ID NOs:48-94) can be used, either alone or in a pharmaceutical composition as 
described above. For example, a protease or a cDNA encoding a protease 
polypeptide, either by itself or included within a vector, can be introduced into cells 
(either in vitro or in vivo) such that the cells produce native protease polypeptide. If 
25 necessary, cells that have been transformed with the gene or cDNA or a vector 
. comprising the gene or cDNA can be introduced (or re-introduced) into an 
individual affected with the disease. Thus, cells which, in nature, lack native 
protease expression and activity, or have mutant protease expression and activity, or 
have expression of a disease-associated protease splicing variant, can be engineered 
30 to express the protease polypeptide or an active fragment of the protease polypeptide 
(or a different variant of the protease polypeptide). In a preferred embodiment, 
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nucleic acid encoding a protease polypeptide, or an active fragment or derivative 
thereof, can be introduced into an expression vector, such as a viral vector, and the 
vector can be introduced into appropriate cells in an animal. Other gene transfer 
systems, including viral and nonviral transfer systems, can be used. Alternatively, 
5 nonviral gene transfer methods, such as calcium phosphate coprecipitation, 

mechanical techniques (e.g., microinjection); membrane fusion-mediated transfer via 
liposomes; or direct DNA uptake, can also be used. 

Alternatively, in another embodiment of the invention, a nucleic acid of the 
invention; a nucleic acid complementary to a nucleic acid of the invention; or a 

10 portion of such a nucleic acid (e.g., an oligonucleotide as described below), can be 
used in "antisense" therapy, in which a nucleic acid (e.g., an oligonucleotide) which 
specifically hybridizes to the mRNA and/or genomic DNA of a protease is 
administered or generated in situ. The antisense nucleic acid that specifically 
hybridizes to the mRNA and/or DNA inhibits expression of the protease 

1 5 polypeptide, e.g. , by inhibiting translation and/or transcription. Binding of the 
antisense nucleic acid can be by conventional base pair complementarity, or, for 
example, in the case of binding to DNA duplexes, through specific interaction in the 
major groove of the double helix. 

An antisense construct of the present invention can be delivered, for 

20 example, as an expression plasmid as described above. When the plasmid is 

transcribed in the cell, it produces RNA which is complementary to a portion of the 
mRNA and/or DNA which encodes the protease polypeptide. Alternatively, the 
antisense construct can be an oligonucleotide probe which is generated ex vivo and 
introduced into cells; it then inhibits expression by hybridizing with the mRNA 

25 and/or genomic DNA of the protease. In one embodiment, the oligonucleotide 

probes are modified oligonucleotides which are resistant to endogenous nucleases, 
e.g., exonucleases and/or endonucleases, thereby rendering them stable in vivo. 
Exemplary nucleic acid molecules for use as antisense oligonucleotides are 
phosphoramidate, phosphothioate and methylphosphonate analogs of DNA (see also 

30 U.S. Pat. Nos. 5,176,996; 5,264,564; and 5,256,775). Additionally, general 

approaches to constructing oligomers useful in antisense therapy are also described, 
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for example, by Van der Krol et al (Biotechniques 6:958-976 (1988)); and Stein et 
al. {Cancer Res. 48:2659-2668 (1988)). With respect to antisense DNA, 
oligodeoxyribonucleotides derived from the translation initiation site are preferred. 

To perform antisense therapy, oligonucleotides (mRNA, cDNA or DNA) are 
5 designed that are complementary to mRNA encoding the protease. The antisense 
oligonucleotides bind to protease mRNA transcripts and prevent translation. 
Absolute complementarity, although preferred, is not required. A sequence 
"complementary" to a portion of an RNA, as referred to herein, indicates that a 
sequence has sufficient complementarity to.be able to hybridize with the RNA, 

10 forming a stable duplex; in the case of double-stranded antisense nucleic acids, a 
single strand of the duplex DNA may thus be tested, or triplex formation may be 
assayed. The ability to hybridize will depend on both the degree of complementarity 
and the length of the antisense nucleic acid, as described in detail above. Generally, 
the longer the hybridizing nucleic acid, the more base mismatches with an RNA it 

1 5 may contain and still form a stable duplex (or triplex, as the case may be). One 
skilled in the art can ascertain a tolerable degree of mismatch by use of standard 
procedures. 

The oligonucleotides used in antisense therapy can be DNA, RNA, or 
- chimeric mixtures or derivatives or modified versions thereof, single-stranded or 
20 double-stranded. The oligonucleotides can be modified at the base moiety, sugar 
moiety, or phosphate backbone, for example, to improve stability of the molecule, 
hybridization, etc. The oligonucleotides can include other appended groups such as 
. peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating 

transport across the cell membrane (see, e.g., Letsinger et al, Proc. Natl. Acad. Sci. 
25 USA 86:6553-6556 (1989); Lemaitre et al, Proc. Natl Acad. Sci. USA 84:648-652 
(1987); PCT International Publication No. WO 88/09810) or the blood-brain barrier 
(see, e.g., PCT International Publication No. WO 89/10134), or 
hybridization-triggered cleavage agents (see, e.g., Krol et al, BioTechniques 
6:958-976 (1988)) or intercalating agents. (See, e.g., Zon, Phann. Res. 5:539-549 
30 (1988)). To this end, the oligonucleotide may be conjugated to another molecule 
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(e.g., a peptide, hybridization triggered cross-linking agent, transport agent, 
hybridization-triggered cleavage agent). 

The antisense molecules are delivered to cells which express protease in vivo. 
A number of methods can be used for delivering antisense DNA or RNA to cells; 
5 e.g. , antisense molecules can be injected directly into the tissue site, or modified 
antisense molecules, designed to target the desired cells (e.g.> antisense linked to 
peptides or antibodies that specifically bind receptors or antigens expressed on the 
target cell surface) can be administered systematically. Alternatively, in a preferred 
embodiment, a recombinant DNA construct is utilized in which the antisense 

10 oligonucleotide is placed under the control of a strong promoter (e.g., pol HI or pol 
II). The use of such a construct to transfect target cells in the patient results in the 
transcription of sufficient amounts of single stranded RNAs that will form 
complementary base pairs with the endogenous protease transcripts and thereby 
prevent translation of the protease inRNA. For example, a vector can be introduced 

1 5 in vivo such that it is taken up by a cell and directs the transcription of an antisense 
RNA. Such a vector can remain episomal or become chromosomally integrated, as 
long as it can be transcribed to produce the desired antisense RNA. Such vectors 
can be constructed by recombinant DNA technology methods standard in the art and 
described above. For example, a plasmid, cosmid, YAC or viral vector can be used 

20 to prepare the recombinant DNA construct which can be introduced directly into the 
tissue site. Alternatively, viral vectors can be used which selectively infect the 
desired tissue, in which case administration may be accomplished by another route 
(e g -, systemically). 

Endogenous protease expression can also be reduced by inactivating or 

25 cc knocking out 5 ' protease or its promoter using targeted homologous recombination 
(e.g., see Smithies et al., Nature 317:230-234 (1985); Thomas & Capecchi, Cell 
51:503-512 (1987); Thompson etal, Cell 5:313-321 (1989)). For example, a 
mutant, non-ftmctional protease (or a completely unrelated DNA sequence) flanked 
by DNA homologous to the endogenous protease (either the coding regions or 

30 regulatory regions of protease) can be used, with or without a selectable marker 
and/or a negative selectable marker, to transfect cells that express the protease in 
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vivo. Insertion of the DNA construct, via targeted homologous recombination, 
results in inactivation of the protease. The recombinant DNA constructs can be 
directly administered or targeted to the required site in vivo using appropriate 
vectors, as described above. Alternatively, expression of non-mutant proteases can 
5 be increased using a similar method: targeted homologous recombination can be 
used to insert a DNA construct comprising a non-mutant, functional protease, e.g., a 
gene having one of SEQ ID NOs: 1-47, or the complement thereof, or a portion 
thereof, in place of a mutant protease in the cell, as described above. In another 
embodiment, targeted homologous recombination can be used to insert a DNA 

10 construct comprising a nucleic acid that encodes a protease polypeptide variant that _ 
* differs from that present in the cell. 

Alternatively, endogenous protease expression can be reduced by targeting 
deoxyribonucleotide sequences complementary to the regulatory region of a protease 
(i.e., the protease promoter and/or enhancers) to form triple helical structures that 

15 prevent transcription of the protease in target cells in the body. (See generally, 

Helene, C, Anticancer Drug Des., 6(6):569-84 (1991); Helene, C. et al.,A?zn. M.Y. 
Acad. Set 660:27-36 (1992); andMaher,L. J.,Bioassays 14(12):807-15 (1992)). 
Likewise, the antisense constructs described herein, by antagonizing the normal 
biological activity of one of the protease proteins, can be used in the manipulation of 

20 tissue, e.g., tissue differentiation, both in vivo and for ex vivo tissue cultures. 

Furthermore, the anti-sense techniques (e.g., microinjection of antisense molecules, 
or transfection with plasmids whose transcripts are anti-sense with regard to a 
protease mRNA or gene sequence) can be used to investigate the role of one or 
protease in developmental events, as well as the normal cellular function of the 

25 proteases in adult tissue. Such techniques can be utilized in cell culture, but can also 
be used in the creation of transgenic animals. 

In yet another embodiment of the invention, other protease therapeutic agents 
as described herein can also be used in the treatment or prevention of a susceptibility 
to a disease or condition associated with a protease. The therapeutic agents can be 

30 delivered in a composition, as described above, or by themselves. They can be 

administered systemically, or can be targeted to a particular tissue. The therapeutic 
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agents can be produced by a variety of means, including chemical synthesis; 
recombinant production; in vivo production (e.g., a transgenic animal, such as U.S. 
Pat. No. 4,873,3 16 to Meade et a/.), for example, and can be isolated using standard 
means such as those described herein. 
5 A combination of any of the above methods of treatment (e.g. , administration 

of non-mutant protease polypeptide in conjunction with antisense therapy targeting 
mutant protease niRNA; administration of a first splicing variant encoded by a 
protease in conjunction with antisense therapy targeting a second splicing encoded 
by a protease), can also be used. 

1 0 The teachings of all publications cited herein are incorporated herein by 

reference in their entirety. 

While this invention has been particularly shown and described with 
references to preferred embodiments thereof, it will be understood by those skilled in 
the art that various changes in form and details may be made therein without 

1 5 departing from the scope of the invention encompassed by the appended claims. 
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Appendix I 

ctgchrl lqLl MOOSE13873 7240429.. 7240464, 7242698..7242815, 
7243685..7243803, 72633 11..7263510, 726S776..7265892, 7267026..7267371, 
7268489..726863S LVDEQPLENYLDMEYFGTIGIGTPAQDFTVWDTGSSNL 
5 WWSWCSSLACTNHbmFNPEDSSTY^ 

VGGISDTNQIFGI^ETEPGSFLYYAPFDGILGI^^ 
LVSQDLFSVYLSADDKSGSVWGGroSSYYTGSmWW 
ITMNGETIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDVSPAPT 
ALFYTQWGWGRSDENPSNFSHPHSFQMWSCSAISSLPDIVF^ 
1 0 PPS AYILQSEGSCISGFQGMlSrvnPTESGELWrLGD VFIRQYFTVEDRANNQVGL 
APVA (SEQIDNO: 48) 

ctggtagatgaacagcccctggagaactacctggatatggagtacttcggcactatcggcatcggaactcctgcccagga 
tttcaccgtcgtctttgacaccggctcctccaacctgtgggtgccctcagtctactgctccagtcttgcctgcACCaacca 
caaccgcttcaaccctgaggattcttccacctaccagtccaccagcgagacagtctccatcacctacggcaccggcagca 
15 tgacaggcatcctcggatacgacactgtccaggttggaggcatctctgacaccaatcagatcttcggcctgagcgagacg 
gaacctggctccttcctgtattatgctcccttcgatggcatcctggggctggcctaccccagcatttcctcctccggggccac 
acccgtctttgacaacatctggaaccagggcctggmctcaggacctcttctctgtctacctcagcGCCgatgacaaga 
gtggcagcgtggtgatctttggtggcattgactctt^ 

actggcagatcaccgtggacAGCatcaccatgaacggagagaccatcgcctgtgctgagggctgccaggccattgtt 
20 gacaccggcacctctctgctgaccggcccaaccagccccattgccaacatccagagcgacatcggagccagcgagaac 
tcagatggcgacgtgagtccagccccgactgccctgttctacactcaagtagtgggtgtgccaggcagaagcgacgaaa 
acccttctaacttttctcaccctcactctttccagatggtggtcagctgctcagccatcagcagcctgcccgacatcgtcttca 
ccatcaatggagtccagtaccccgtgccacccagtgcctacatcctgcagagcgaggggagctgcatcagtggcttcca 
gggcatgaacgtccccaccgaatctggagagctttggatcctgggtgatgtcttcatccgccagtactttaccgtcttcgac 
25 agggcaaacaaccaggtcggcctggcccctgtggct (SEQ ID NO: 1) 

ctgchrl lq_l MOOSE13874 7136792..7136938, 7138056..7138401, 
7139535..7139651, 7141003..7141202, 7141878. .7141996, 7142867..7142984, 
7145221. .7145256 LVDEQPLENYLDMEYEGTIGIGTP AQDFTVLFDTGS SNL 
WVPSVYCSSLACTNHNRFMPEDSSTYQSTSEWSITYGTGS 
30 VGGISDTNQIFGLi>ETEPGSFLYYAP 

LVSQDLFSVYI^ADDKSGSVVlFGGroSSYYTGSLNWWVT^ 
ITIVD^GETIACAEGCQAIVDTGTSLLTGPTSPIAMQSDIGASENSDGDV 
ALFYTQWGWGRSDENPSNFSHPHSFQMWSCSAISSLPDIW 
PPSAYILQSEGSCISGFQGMNVPTESGELWILGDWIRQYFTVro 

35 APVA (SEQ ED NO: 49) 

ctggtagatgaacagcccctggagaactacctggatatggagtacttcggcactatcggcatcggaactcctgcccagga 

tttcactgtcctcmgacaccggctcctccaacc^^ 

aaccgcttcaaccctgaggattcttccacctaccagtccaccagcgagacagtctccatcacctacggcaccggcagcat 
gacaggcatcctcggatacgacactgtccaggttggaggcatctctgacaccaatcagatcttcggcctgagcgagacgg 

40 aacctggctccttcxtgtattatgctcc^ 

cccgtcmgacaacatctggaaccagggcctggtttctcaggacctcttctctgtctacctcagcGCCgatgacaagagt 

ggcagcgtggtgatctttggtggcattgactcttcttactacactggaagtctgaactgggtgcctgtto 

ctggcagatcacxgtggacAGCatcaccatga^ 

acaccggcacctctctgctgaccggcccaaccagccccattgccaacatccagagcgacatcggagccagcgagaac^ 
45 cagatggcgacgtgagtccagcccxgactgccctgttctacactcaagtagtgggtg^gccaggcagaagcga 
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cccttctaacttttctcaccctcactctttccagatggtggtcagctgctcagccatcagcagcctgcccgacatcgtcttcac 
catcaatggagtccagtaccccgtgccacccagtgcctacatcctgcagagcgaggggagctgcatcagtggcttccag 
ggcatgaacgtccccaccgaatctggagagctttggatcctgggtgatgtcttcatccgccagtactttaccgtcttcgaca 
gggcaaacaaccaggtcggcctggcccctgtggct (SEQ ID NO: 2) 
5 ctglOOOS MOOSE13895 396108.396195, 396236.396353, 396946.397108, 
397589.397813, 398026.398172, 398273.398362, 420625..420664, 
456670..456713 DKAWLKRGNKQFNEGKESDR^ 

NQEYSQGKHSFTMAMNAFGDMTNEEFRQVMNGFQYQKHRKGKQFQERLL 
LEIPTSVDWREKGYMTPVKDQGQCGSCWAFSATGALEGQMFWKTGKLISL 
1 0 NEQNLVDCSGPQGNEGCNGDFMDNPFRYVQENGGLDSEASYPYEGKVKTC 
RYNPKYSAANDTGFVDIPSREKD1AKAVATVGPISV 
YFEPRCDPEGLDHAMLWGYSYEGADSDNNKYWLVKNM 
NHISEIPEVFFPLRQ (SEQ ID NO: 50) 

gacaaggcatggttgaaaagagggaacaaacagttcaatgaaGGAaaggaatctgatagatgtctgatatttaaatgta 
15 aaaataaggacgtgaagatgattgagcagcacaatcaggaatacagccaagggaaacacagcttcacaatggccatgaa 
cgcctttggagacatgaccaatgaagaattcaggcaggtgatgaatggttttcaataccagaagcacaggaaggggaaac 
agttccaggaacgcctgcttcttgagatccccacatctgtggactggagagagaaaggctacatgactcctgtgaaggatc 
agggtcagtgtggctcttgttgggcttttagtgcaactggtgctctggaagggcagatgttctggaaaacaggcaaacttat 
ctcactgaatgagcagaatctggtagactgctctgggcctcaaggcaatgagggctgcaatggtgacttcatggataatcc 
20 cttccggtatgttcaggagaacggaggcctggactctgaggcatcctatccatatgaaggaaaggttaaaacctgtaggta 
caatcccaagtattctgctgctaatgacactggttttgtggacatcccttcacgggagaaggacctggcgaaggcagtggc 
aactgtggggcccatctctgttgctgttggtgcaagccat 

gctgtgaccctgaaggcctggatcatgctatgctggtggttggctacagctatgaaggagcagactcagataacaataaat 
attggctggtgaagaacAGGgaatcctttfflgcaatoagtatto^ 

25 -gaagtctttttcccccttaggcag (SEQ ID NO: 3) 

ctgChr_13ctg3 MOOSE13908 31779459.31779493, 31784161.31784194, 

31786584.31786698, 31787274.31787332, 31787822.31787839, 
31788078.31788126, 31796102.31796145, 31796615.31796690, 
31806469.31806527,31807429.31807509 

30 SNVKPRKKNPKGFTVNVL^ 

SASNSCGTISNLGSIENGTNEGKYKINFILKVILK^ 
CLHCTVETSDYQVQAPSIDEKVDLHF3ALYHVDGHLYELDG 
SDETLLEDAIEVCKKFMERDPDELRFNAIALSAA (SEQ ID NO: 51) 
tcaaatgtgaagccaagaaagaaaaatccaaaaGGGttcactgttaatgtcctggcaaacattggtgtcataaatcactg 

35 gaaattacccaaacttactcacaaaggggaaaaaaatagtggaaaagagggggaaaaaccacacaaaacaaacagtgc 
atccaacagctgtgggacaatttcaAATttaggaagcattgaaaatggaacaaatgaaggtaaatataaaataaatm 
cttaaagtcamtaaaaaacgcaaacactgttagaactgaaaagagaaatagacaaattcacaattatAACtgttta 
gcactgttgaaacatcagattatcaagttcaagcaccaagtatagatgaga^^ 
agatgggcatctctatgaattaGATgggcggaagccamc^ 

40 ggatgcxatagaagmgcaagaagtttatggagcgcgaccctgatgaactaagatttaatgcgattgcto 

(SEQ ID NO: 4) 

ctgl5ctg27 MOOSE13930 800712..800756, 825940..826087, 826815..827041, 
828426..828548, 855490..855558 

ACLVSGFGPFRQHLVNSSWEAVKELSKIXjLGNETVVQLRT^ 
45 RRVTGIWEDHQPQLVVHVGMDTAAKAn^ 

VCLPGSPDVI^SGVCMKAVCXRVAVEGVDWSRD^^ 
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HGKGCAALIir/PPLSRGLP^ 
(SEQ ID NO: 52) 

gcatgtttggtttcaggttttgggcccttccggcagcacttggtgaattccagctgggaagcagtgaaggagctctccaagc 
tgggcctggggaatgaaacagtggtgcagctgcggactctggagctgcctgtagattacagggaggctaagcggaggg 
5 tcaccggaatctgggaagatcatcagccgcaactcgtcgtgcatgtgggcatggacaccgccgccaaggcgatcattctg 
gaacagtctggcaagaaccaaggctaccgggacgccgacatccgcagcttctggcccgagggcggcgtgtgcctacct 
ggcagcccagacgtgctggagtcaggggtctgcatgaaggcagtctgcaagcgcgtagctgtggagggtgtcgacgtg 
atcttttcccgagatgcaggcAGAtacgtctgtgattatacctattacctgtctctgcatcatggaaagggctgcgcggca 
ctcatccatgtccctccactatcgcgcgggctcccggccagcctgctgggaagagccttgagagtcatcatccaggaaat 
1 0 gctggaagaggcaggggagaaacaaaaagaggtgacggcttctggcacatcccac (SEQ ID NO: 5) 
ctgChr_Xctg264 MOOSE13940 1447207.. 1448280 
RIGKSVYTVGLRG 

CLVCEMSSLFHAMYSGSRTPHIPYKLLHLIWIHAEHLA 

DVLHRHSKDDSGGQEANNPNCCNCHDQIFTGGLQSDVTCQACHSVSTTIDP 
1 5 CWDISLDLPGSCATFDSQNPERADSTVSRDDHIPGIPSLTDCLQWFTRPEHLG 
SSAKIKCNSCQSYQESTKQLTMKKL^^ 
LEIXMTPFLASTKESRMKEGQPPTO 

F1RQQKDQWFSCDDAIITKATIEDLLYSEGYLLFYHKQGLEK (SEQ ID NO: 53) 
agaaaaaagtcagtctatactgtaggcctgagagggctaatcaatcttgggaacacttgttttatgaattgtattgt 

20 ccaggcacttacccatattcctctactgaaagatttcttcctctctgacaagcacaaatgtataatgacaagccccagcttgtg 
tctggtctgtgaaatgtcttcgcttmcatgctatgtactctgggagccgaactcctcacattccctataagttactgcato 
atggatccatgcagaacatttagcagggtacaggcagcaggatgcccatgagttccttattgcaatattagacgtgctacat 
agacacagcaaagatgatagtggtgggcaggaggccaataaccccaactgctgtaactgcatcatagaccaaatctttac 
aggtggcctgcaatcagatgtcacatgtcaagcctgccatagtgtttctaccaccatagacccatgctgggacatcagtttg 

25 gacttgcctggctcttgtgccacattcgattcccagaacccagagagggctgacagcacagtgagcagggatgaccacat 
accaggaatcccctcacttacagactgtctacagtggtttacaaggccagagcacctaggaagcagtgccaaaatcaaat 
gcaatagttgccaaagctaccaggagtctactaaacagctcacaatgaaaaaattacccattgtggcttgttttcatctcaag 
cggmgagcatgtaggcaaacagaggcgaaagattaataccmatctccmcccttggagctggacatgactccgtttttg 
gcctctactaaagagagcagaatgaaagaaggccagccaccaacagattgtgtgcccaatgagaataagtattccttgttt 

30 gcagtgattaatcaccatggaactttggaaagtggccactataccagcttcatccggcaacaaaaggaccagtggttcagc 
tgtgatgatgccatcatcaccaaggctaccattgaggacttactctacagtgaagggtatttactgttctatcacaaacaggg 
tctagagaaa (SEQ ID NO: 6) 

ctgChr_Xctg264 MOOSE13941 1564654.. 1565727 

RJOCSVYTVGLRGLINLGN^ 
35 CLVCEMSSLFHAMYSGSRTPHIPYKLL^ 

DVLHRHSKDDSGGQEANNPNCCNCITOQIFTGGLQSDVTCQACHSVSTTro 

CWDISLDLPGSCATFDSQNPERADSWSRDDH1PGIPSLTO 

S S AKIKCNS CQS YQESTKQLTMKKXPIV ACT 

LELDMTPFIJ^TKESRMKEGQPPTDCW 
40 FIRQQKDQWSCDDAinXATffiDLLYSEGYLIJ^YH^ (SEQ ID NO: 54) 

agaaaaaagtcagtctatactgtaggcctgagagggctaatc^ 

ccaggcacttacccatettcctctactgaaagatto^ 

tctggtctgtgaaatgtcttcgctttttcatgrt^^ 

atggatccatgcagaacamagcagggtacaggcagcaggatgcccatgagttcctottgcaatattagacgtgc^ 
45 agacacagcaaagatgatagtggtgggcaggaggccaataaccccaactgctgtaactgcatcatagaccaaatctt^ 
aggtggcctgcaatcagatgtcacatgfcaagcctgc^a^ 
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gacttgcctggctcttgtgccacattcgattcccagaacccagagagggctgacagcacagtgagcagggatgaccacat 
accaggaatcccctcacttacagactgtctacagtggtttacaaggccagagcacctaggaagcagtgccaaaatcaaat 
gcaatagttgccaaagctaccaggagtctactaaacagctcacaatgaaaaaattacccattgtggcttgttttcatctcaag 
cggtttgagcatgtaggcaaacagaggcgaaagattaatacctttatctcctttcccttggagctggacatgactccgtttttg 
5 gcctctactaaagagagcagaatgaaagaaggccagccaccaacagattgtgtgcccaatgagaataagtattccttgttt 
gcagtgattaatcaccatggaactttggaaagtggccactataccagcttcatccggcaacaaaaggaccagtggttcagc 
tgtgatgatgccatcatcaccaaggctaccattgaggacttactctacagtgaagggtatttactgttctatcacaaacaggg 
tctagagaaa (SEQ ID NO: 7) 

ctgChr_Xctg26 MOOSE13943 4457988..4459058 

10 JOTS SFTIGLRGLINLGNTCFMN^ SPELCL 
VCEMSSLFRELYSGNPSPHWYKLLHLVWmAWHDLAG 
VLHRHCKGDDVGKAANNPNMCNCIIDQOT 

WDISLDLPGSCTSFWPMSPGRESSVNGESHIPGITTLTDCLRRFrR^ 
KIKCGSCQSYQESTKQLTMNKUPVVACFHFKRFEHSAKQ 

15 DMTPFMAS SKESRMNGQLQLPTNS GNNENKYSLF AVVNHQGTLES GHYTSF 
IRHIIKI)QWFKCDDAVITKASIKDVLDSEGY^ (SEQ ID NO: 55) 

agaatcacctccagctttacgatcggtttaagaggactcatcaatcttggcaacacgtgctttatgaactgcattgt 
ccaggccctcacccacacgccgatactgagagatttctttctctctgacaggcaccgatgtgagatgccgagtcccgagtt 
gtgtctggtctgtgagatgtcgtcgctgtttcgggagttgtattctggaaacccgtctcctcatgtgccctataagttactgcac 

20 ctggtgtggatacatgcccgccatttagcagggtacaggcaacaggatgcccacgagttcctcattgcagcgttagatgtc 
ctgcacaggcactgcaaaggtgatgatgtcgggaaggcggccaacaatcccaaccactgtaactgcatcatagaccaaa 
tcttcacaggtggcctgcagtctgatgtcacctgtcaagcctgccatggcgtctccaccacgatagacccatgctgggaca 
ttagtttggacttgcctggctcttgcacctccttctggcccatgagcccagggagggagagcagtgtgaacggggaaagc 
cacataccaggaatcaccaccctcacggactgcttgcggaggtttacgaggccagagcacttaggaagcagtgccaaaa 

25 tcaaatgtggtagttgccaaagctaccaggaatctaccaaacagctcacaatgaataaattacctgtcgttgcctgttttcattt 
caaacggtttgaacattcagcgaaacagaggcgcaagatcactacatacatttcctttcctctggagctggatatgacgccg 
tttatggcctcaagtaaagagagcagaatgaatggacaattgcagctgccaaccaatagtggaaacaacgaaaataagtat 
tccttgtttgctgtggttaatcaccaaggaaccttggagagtggccactataccagcttcatccggcaccacaaggaccagt 
ggttcaagtgtgatgatgccgtcatcactaaggccagtattaaggacgtactggacagtgaagggtatttactgttctatcac 

30 aaacaggtgctagaa (SEQ ID NO: 8) 

ctg8ctg6 MOOSE13948 967504..968427 

LSSRRPAAVGAGLQNMGNTCYENASLQCLTYTPPLANYMLSREHSQTCQRP 
KCCMLCTMQAHi™ 

AMKKACLPGHKQVDHHSKDTTLIHQIFGGCWRSQIKCIJH 
35 IALDIQAAQSVKQALEQLVKPEELNGENAYHCGLCLQRAPASKTLTIJHTSAK 
VLILVLKRFSDVTGNKLAKNVQYPECLDMQPYM 
HAGWSCHDGHYFSYVKAQEGQWYKMDDAKVTACS 
KSE (SEQ ID NO: 56) 

ctgagtagcaggagacctgctgcggtgggggctgggctccagaatatgggaaatacctgctacgagaacgcttccctgc 
40 agtgcctgacatacacaccgccccttgccaactacatgctgtcccgggagcactctcaaacatgtcagcgtcccaagtgct 
gcatgctctgtactatgcaagctcacatcacatgggccctccacagtcctggtcatgtcatccagccctcacaggcattggc 
tgctggcttccatagaggcaagcaggaagatgcccatgaatttctcatgttcactgtggatgccatgaaaaaggcatgcctt 
cccggccacaagcaggtagatcatcactctaaggacaccaccctcatccaccaaatatttggaggctgctggagatctca 
aatcaagtgtctccactgccacgggatttcagacacttttgaccx^ttacctggacatcgccctggatatccaggcagctcag 
45 agtgtcaagcaagctttggaacagttggtgaagcccgaagaactcaatggagagaatgcctatcattgcggtctttgtctcc 
agagggcgccagcctrcaagacgttoactttacacacttctgccaaggtc^^ 
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acaggcaacaaacttgccaagaatgtgcaatatcctgagtgccttgacatgcagccatacatgtctcagcagaacacagg 
acctcttgfctatgtcctctatgctg^ 

caagaaggccagtggtataaaatggatgatgccaaggtcactgcctgtagcatcacttctgtcctgagtcaacaggcctat 
gtcctcttttacatccagaagagtgaa (SEQ ID NO: 9) 
5 ctgChr_lctg34 . MOOSE13952 2S934534..28934662, 28935089..28935291, 
28936850..28937023, 28937998..28938168, 28940593..28940826, 
28943293..2894331 1, 28943898..28943978 
CEKRENLLPFVGLNNTL^ 
LNPMYEGYLQETOAQEVLQ 
1 0 KEEMNGEEQIGFELVEKLFQGQLVLRTRCLECESLTEIUREDFQDIS VPVQEDE 
J^KVEESSEKMKTLRWAISQFASVERW 

MPEVITIHLKCFAASGNLFFFSSQRFDCYGGGLSKINTPLLTPLKLSLEEW 

PTNDSYGLFAVVMHSGrTIS 

NSLSPSTSPTSTPYLLFYKK (SEQ ID NO: 57) 

15 tgtgagaagagagaaaacttgttaccatttgtgggactgaataatctcggcaatacttgctatcttaatagtatacttcaggtat 
tatatttttgtcccGGCtttatgtactgtatcttcaaaactagaatagatgaaatggaaatttttatttatagggaactcaaccct 
atgtatgaaggatatctacagcatgatgcacaggaagtattacaatgtattttgggaaacattcaagaaacatgccaactcct 
aaaaaaagaagaagtaaaaaatgtggcagaattacctactaaggtagaagaaatacctcatccgaaagaggaaatgaat 
GGTgaagaacaaattggttttgagctagtggagaaattatttc^^ 

20 gaaagtttaacagaaagaagagaagattttcaagacatcagtgtgccagtacaagaagatgagctttccaaagtagagga 
gagttctgaaAAAatgaagaccctgagatgggcaatttcacaatttgcttcagtagaaaggattgtaggagaagataaat 
atttctgtgaaaactgccatcattatactgaagctgaacgaagtcttttgtttgacaaaatgcctgaagtetaactatt^ 
aagtgcmgctgctagtGGAaacctmctttttttcctcccaaaggtttgattgttatggtggtgg 
ctcctttattgacacctctlaaattgtcactagaagaatggagcacaaagccaactaacgacagctatggattatttgcggttg 

25 tgatgcatagtggcattacaattagtagtgggcattacactgcttctgttaaagagtatgaggggaagtggttgctttttgatg 
attctgaagtcaaagttactgaagagaaggactttctgaattctcWccccttctacatctcctacttctactcctt^ 
ttataagaaa (SEQ ID NO: 10) 
ctg8ctg4 MOOSE13954 6009332..6010255 
LSSRIUPAAVGAGLQNMGNTCYE 

30 KCCMLCTMQAfflTWALHSPGHVIQPSQAlAAGFHRGKQEDV^ 
AMKKACLPGHKQVDHHCKDTTLIHQIFGGCWRSQ1KCLH 
DIALDIQAAQSVKQALEQLVI<PEELNGENAYHCGLCLQRAPASOT 
KVLILVLKRFSDVAGNKLAKNVQYPECLDMQPYM 
VHAGWSCHDGHYFSYVKAQEVQWYKMDDAEVTV 

35 QKSE (SEQ ID NO: 58) 

ctgagtagcaggagacctgctgcggtgggggctgggctccagaatatgggaaatacctgctacgagaacgcttccctgc 
agtgcctgacatacacactgccccttgccaactacatgctgtcccgggagcactctcaaacatgtcagcgtcccaagtgct 
gcatgctctgtactatgcaagctcacatcacatgggccctccacagtcctggccatgtcatccagccctcacaggcattgg 
ctgctggcttccatagaggcaagcaggaagatgtccatgaatttctcatgttcactgtggatgccatgaaaaaggcatgcct 

40 t<xx>ggc^acaagcaggtagatcatcactgcaaggacaccaccctcatccaccaaatatttggaggctgctggagatctc 
aaatcaagtgtctccactgcca^gggatttcagacacttttgacccttacctggacatcgccctggatatccaggcagctca 
gagtgtcaagcaagctttggaacagttggtgaagcccgaagaactcaa 

cagagggcgccggcxjtccaacacgttaacmacacacttctgccaaggtcctcatccttgtcttgaagagatt^ 
cgcaggcaacaaacttgccaagaatgtgcaatatcctgagtgccttgacatgcagccatacatgtctcagcagaacacag 
45 gacctettgtctatgtcctctatgctgtgrt 

ctcaagaagtccagtggtaiaaaatggatgatgccgaggtcact^ 
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gtcctcttttacatccagaagagtgaa (SEQ ID NO: 11) 

ctg21fin2 MOOSE13975 2828213..2828293, 2836386..2836472, 
2845420..2845514,2846133..2846213,2860640..2860778, 2868030..2868242 
YDRKRQDKAPVGLKNVGNT 
5 PRNQKAFFPSQQDVSEFTHKLLDW^ 

GEDMKWYSHCGKHFCYCHSFQHWFTELPPVLTFELSKFEFNQALGRPE 
NKI^EFPQWYRLHAVLVHEGQANAGHYWAYIF 
SWEELVIU3SFGGYRNASAYCLMYIh^ (SEQ ID NO: 59) 
tatgatagaaaaagacaggacaaagcjtcccgttgggctaaagaatgttggcaatacttgttggtttagtgctgttattcagt^ 
10 attatttaatctmggaatttagaagattagttctgaattac^ 
gcttttttctmcacagcaagatgtgagtgagmacacac 

aagaggagACGgttggcaaggatgtggagaaattgaaacccttgtgcagtgttggtgaggatatgaaatggtacagcc 
actgtggaaaaCATmtgttattgttttatttcctttcagcattggtttactgaattaccacctgtgttaacattt 
gatttgaatttaatcaggcattgggaagaccagaaaaaattcacaacaaatta^ 
15 ccgttttagttcacgaaggccaagctaatgctgggcactactg^ 

tacaatgatattgctgtgacaaaatcatcatgggaagagctagtgagggactcttttggtggttatagaaatgccagtgcata 

ctgtttaatgtacataaatgataaggca (SEQ ID NO: 12) 

ctgchrllq_5 MOOSE13977 18732295..18732375, 18738638..18738724, 
18739545.. 18739553, 18740980.. 18741048, 18743645..18743770, 
20 18745607.. 18745668, 18792900..1 8792942, 18807171.. 18807377 
NDWRRVDGWPVGLJGW 

CRSHTEQQQDVSEFTHKIXDWLEDAFQ]^^ 

GAMVEGDVELLPSDHSVKYGQERWFTKLPPVLTFEI^RFEFNHSW 

SK^HTWYRLHAVLVlffiGQANAGH^ 

25 WEEVERDSYGGLRNVSAYCLMYIND (SEQ ID NO: 60) 

aatgactggaggagagttgatggttggccagttgggctgaaaaatgttggcaatacatgttggmagtgctgttattcagtc^ 
ctcmcaattgcctgaamcgaagacttgttctcagttatagtctgccacaaaatgtacttgaaaattgtcgaagt^ 
acagcagcaagatgtgagtgaattcacacacaagctcctggattggctagaggacgcattccagctagctgttaataccttc 
ggccagtatcctcttcaggtaaacggttatcgcaacttagacgagtgtttggaaggggccatggtggagggtgatgttgag 

30 cttcttccctccgatcactcggtgaagtatggacaagagcgtt^ 

agatttgagtttaatCACtcctggggcagagacaagaaggattcaaaggccttgcacacagtgccttatcgcttgcatgc 
agttcttgttcatgaaggacaagcaaatgctggacactattgggcctatatctataatcaaccccgacagagctggctcaag 
tocaatgacatctctgttactgaatcttcctggg 
tgtctgatgtacattaatgac (SEQ ID NO: 13) 

35 ctgl7005 MOOSE13980 1430240.. 1430370, 1438626.. 1438974, 

1439081..1439207, 1445603. .1445624, 1451804..1451827, 1452650.. 1452714, 
1473305..1473354 SGSSPSSSWPSGIJ^SCPIFQCLFMLHLLSRSQSFLWPRV 
RMRRQHGALEFHRVLFGSLQEERAQDADSVWQQQQAHQQHSCTLDECFQF 
YTKEEQLAQDDAWKCTHCQVLQQGMV^ 

40 NKLSTLVKFPLS GLNMAPHV AQRSTSPEAGLGP WPS WKQPDCLPTS YTLDFL 
YDLYAVCNHHGNLQGGITYrAYCRNSIJ)GQ 
GAYILFYQKRNSIP (SEQ ID NO: 61) 

tctggaagctctccttccagttcatggccttctggcctccggtcgagcTGTcccattmcaatgtttgtttatgttgcatttgc 
tttcgaggtctcagtcattcctctggccaAGAgtgaggatgaggaggcagcatGGTgctctggaatttcatagaGT 
45 Cctgttcgggagcctccaggaggagcgagcgcaggatgccgacagtgtgtggcagcagcagcaggcgcatcagcag 
cacagctgtaccttggatgaatgttttcagttctacaccaaggagg^cagctggcccaggatgacgcctggaagtgtcct 
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cactgccaagtcctgcagcaggggatggtgaagctgagtttgtggacgctgcctgacatcctcatcatccacctcaaaag 
gttctgccaggtgggcgagagaagaaacaagctctccacgctggtgaagtttccgctctctggactcaacatggctcccc 
atgtggcccagagaagcaccagccctgaggcaggactgggcccctggccttcctggaagcagccggactgcctgccca 
ccagttacccgctggacttcctgtacgacctgtatgccgtctgcaaccaccatggcaacctgcaaggtgggcattacaca 
5 GCCtactgccggaactctctggatggccagtggtacagttatgatgacagcacggtggaaccgcttcgagaagatgag 
gtcaacaccagaggggcttatatcctgttctatcagaagcggaacagcatccct (SEQ ID NO: 14) 
ctgCHR12_ll ^ MOOSE13982 65213..65322, 76770..76857, 78921. .78986, 
80779..80841, 88642..88701, 89241..89327, 90016..90233, 92975..93203 
LPPAFFLGLWGLVNLGNTCF^ 

1 0 AQEGIHLYRQQDAHELFHVTTSSLEDEI^ 
HQSPVRFDTFDSLSLSIPAATWGHPLTLDH^ 
TFVKQLKLGKVSPHYTPCWLCFEDSV^ 
RLSWSSHGTPLKRHEHVHSSTYLFRLMAVVVHHGDMH 
RNPLSTSNQWLWVSDDTVRKASLQEVLSSSAYI.LFYERVLSRM (SEQ ID 

15 NO: 62) 

cttccccctgcttttttmagggcttgtgcctggccttgttaatttagggaacacctgctt^ 
gtctgcctgtcctgctttcatcAGCcttg^ 

agggaattcatttatataggcaacaagatgctcacgaattattccatgtcattacctcgtcattggaagatgagcgagaccgc 
cagcctcggtctcaacatccttttcatggaagactcactagtaatatggtctgcaaacactgtgaacaccagagtcctgttcg 
20 amgatacctttgatagcctttcactaagtattccagccgccacatggggtcacccattgaccctggaccactgccttcacca 
cttcatctcatcagaatcagtgcgggatgttgtgtgtga 

ggaaggtgagcccacactacacaccctgttggctttgttttgaggactccgtgtatcctgcccctgaaacaactcggttctcc 
cgatttctcttccacccgcagctccctcagtgtctctgcatccacctacagcggctgagctggtccagccacggcacgcct 
ctgaagcggcatgagcacgtgCACtcctccacatacctcttccggctgatggcagttgtcgtccaccatggagacatgc 
25 actctggacacmgtcactmccgacggtccccaccttctgccaggaaccctctctcaactagcaatcagtggctgtgg^ 
ctccgatgacactgtccgcaaggccagcctgcaggaggtcctgtcctccagcgcctacctgctgttctacgagcgcgtcct 
ttccaggatg (SEQ ID NO: 1 5) 

ctg2013 MOOSE14044 1981177..1981331, 1981824.. 1981954, 

1982040.. 1982121, 1984518..1984699, 1985403..1985520, 1985943..1985983, 
30 2073983..2074221 MLSSPDFYPAYPSAMQSI^LGSLARALEmTQYFNNW 

NWVYDNIID QNESKLSKSRREEIERDRERKERREGDREKKRQNGVVE W 

SSKYDEPSRQV1LEALAEFERSTCIRFVTYQDQRDFISIIPMY 

MQWSLAPTCLQKGRGIVLHELMHV^ 

GFEINFnCSRSSNMLTPYDYSSVMHYGRLAFSR^ 
35 NLSASDITRVLQLYGCSPKXEKVNTVNIKin^LYTQLE 

NKSDSCIKYLRKNE (SEQ ID NO: 63) 

atgctctcctcccx>cgaMctacccagct^ 

ctgatgacccagtacmaacaattggaactgggtcte^ 

ggagagaagagatagagagagacagagagagaaaagagagaagagagggagacagggaaaaaaagagacagAA 
40 TggJgtcgtggaggtccccttcctgctctccagcaagtacGATgagcccagccgccaggtcatcctggaggc 

ggagmgaacgttccacgtgcatcaggmgtcacctatcaggaccagagagacttcatttccatcatccccatg^ 

Gtgcttctcgagtgtggggcgcagtggagggatgcaggtg^ 

cattgtccttcatgagctcatgcatgtgctgggcttrt^ 

caactggaacgagatcctgccaGGCtttgaaatcaactt^ 
45 tactcctctgtgatgcactatgggAGGcttgccttcagccggcgtgggctgcccaccat^ 

gtgtrcacatcggcxagcgatggaacctgagtgccte 
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AAttagaaaaggtaaatacagttaatataaaaatcatatttctgtacacccagttagaaaacacaattgttagtaaacatacc 
attataatagcaatcataaaggtcccaagaaataaatctgacagctgtatcaaatatttgaggaaaaatgaa (SEQ ID 
NO: 16) 

ctgCHR12_3 MOOSE14065 6845696..6845761, 6845907..6846079, 
5 6849800..6849892, 6852604.6852721, 6858283..6858419, 6858523. .6858704, 
6859916..6860061, 6868904..6869011, 6870600..6870741, 6872661. .6872804, 
6874609..6874714, 6896403.. 6896443, 6898513..6898637, 6899178..6899261, 
6908160..6908289, 6919052..6919123, 6920651. .6920743, 6925041. .6925098, 
692634S..6926386, 6929101..6929215 
10 FQLWTWLRPCPVTWffRFPGGGVFPGGSLSPLHILGTKAFFK 
MEHSNDnSPHFKEITQnTSFQEDEEE 
WB^RAKSLYIQILSQRHSRKKEU^YPRYffi 
LTLMSIVATIYKXJPS^ 

QTQNDLDDVHPSHHDTAVLITREDICSSKIEKCNML 

1 5 EEKGLISAFTIAHELGHTLGVQ 

SWSNCSRKYVTEFLDTGYGECLLDKPDEEIYNLPSELPGSRYDGNKQCELAF 
GPGSQMCPHIEMCMHLWCTSTEKLHKGCFTQHWPADGTD 
GLCVNBCETETRPVNGEWGPWEPYSSCSRTCGGGffiSATRRCKF^ 
YCVGRRMKFRSCNTDSCPKGTQDFREKQCSOT 

20 YSGIGTKDRCKLYCQVAGTNYFYLLKDMVEDGTPCGTETHDICVQGQCIV^ 
AGCDHVLNSSAKnDKCGVCGGDNSSCKTITGVFNSSHYGYNVVV^ 
NVDIRQYSYSGQPDDSYLALSDAEGOTLFNGNFLLSTSKKEI^ 
SGSNNAVERtNSTNRQEKELIL^ (SEQ ID 

NO: 64) 

25 ttccagctctggatatggttaaggccttgccctgtgacctggattcccaggtttcctggtgggggtgtatttcctggaggcagt 
ctctcccccttgcacattttggggaccaaaGCAttTO 

catagtaatgacataatcagtccacattttaaagagatcactcagattATCacatcttttcaagaaataatagaggaagagt 
ttggaatatcacaatgctatacatataataatccatctaaatctgacattaggatccatTGGacagtatcagatttgtctcag 
gtmtattctaagtagagccaaaagcctttaMccaa^ 

30 ccaagatacattgaaattatggttacagctgatgctaaagtggtttctgctcatggatcgaatttgcaaaactatatactgactc 
taatgtcaattgttgcaacaatctacaaagatccaagtattggaaatttgatacacatagtagtggtaaaattagttatgatt^ 
ccgtgaggaggaaggaccagtcattaattttgatggtgctaccacattaaagaacttttgttcatggcaacaaactcagaatg 
accttgatgatgttcacccttcccaccatgacactgctgttcttatcactAGGgaagacamgttcatctaaagagaaat^ 
aacatgttaGGTttatcatamaggtaccatatg^gatcctttacaaagctgctttattaatgaagaaaaaggactcatttctg 

35 cmtactatagcccatgagcttgggcacACActtggtgttcaacatgatgataatcctagatgtaaagaaatgaaagttac 
aaagtatcatgtaatggcccctgcmaagtmcacatgagtccttggagctggtcaaactgtagtcggaaatetgtt 
attcctaGATactggttacggggaatgtcttcttgacaaaccagatgaagaaatatataatctgccttcagaacttcctgga 
tcacgatotgatggaaacaagcagtgtgagcttgcgmggtcctgg 

gcatctgtggtgcacaagcacagaaaagcttcacaaaggctgtttcactcaacacgtgccaccagcagatggaacagact 
40 gcggtcctggaatgcattgccgtcatgggctatgtgtaaa^ 

cx^atgggaaccttacagttcttgttcaagaacatgtggaggcggaatcgaaagtgcaaccaggcgctgtaatcgtcctGA 
Gccaagaaacggaggaaattactgtgtgggccgcaggatgaaa 

cacaagacmcgagagaagcagtgctctgattttaatggtaaacamggacatcagtggcattccctctaatgtgagg^ 
cttccaagatacagtggcATTggcacaaaggatcgttgtaaart^ 
45 gaaggatatggttgaagatggtactrcttgtggaactga 

gtgatcacgtgttaaactccagtgccaagatagacaaatgtggagtgtgtggtggggacaactcttcatgcaag^^ 
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caggtgtcttcaacagttctcattatGGTtataatgttgttgtaaagattcccgcaggagcaacaaacgttgacattcgtca 
gtacagctattctggacaaccagatgacagttoccttGCA 

tctaagtacgtcaaaaaaagaaatcaatgtgcaaggaacaagaactgttattgaatacagtggatcaaataacgcagttgaa 
agaattaatagtactaatcgacaagagaaagaacttattttgcaggtg^ 
5 attccttcaatatccctttggaagag 

ctgl0007 MOOSE14068 24799456.-24799495, 24807076..24807253, 
2481 1769..2481 1846, 24812886..24813009, 24813394..24813527, 
24815913. .24816088, 24818085. .24818233, 24820282..24820395, 
24821774..24821906, 24823047..24823190, 24824716..24824821, 
1 0 24826817..24826973, 24827698..24S27777, 24848289.-24848468, 
24848491. .24848535, 24848850..24848955, 248491 53..24849274, 
24849572..24849605 PSLLSCLLSFPRPGPDIAWQLSCKGSWIGTQTHSLTVSS 
ATDFVLRWQMtMVEY^ 

APPVGWTSIJSSCLDLPNLLGLVGDQLGDTERK^ 
1 5 VVT^HGKEHVQNYVLTLMNIVNEIYHDESLGVH^ 

ERGNPSRSLEQVCRWAHSQQRQDPSHAEHHDHVWLTRQDFGP 

PVTGMCHPLRSCALNHEDGFSSAFVIAHETGHVLGMEHDGQGNGCADE 

GSVMAPLVQAAFHRFITWSRCSKLELSRYLPSTO 

GINYSMDEQCRFDFGSGYQTCLAFRTFEPCKQLWCSHPDNPYFCKTKKGPP 

20 LDGTECAPGKWCFKGHCIWKSPEQTYGQDGGWSSWTKFGSCSRSCGGGVR 
SRSRSCN1SIPSPAYGGRLCLGPMFEYQVCNSEECPGTYEDFRAQQCAKRN 
YWQNAEMSWVPYEPDDDAQKCELICQSADTGDVWMNQVVHDGTRCS 
RDPYSVCARGECVPVGCDKEVGSMKADDKCGVCGGDNSHCRTVKGTLGK 
ASKQAAIJKXVQIPAGARHIQIEA^ 

25 TFTAMGLEWEDAVEDAKESLKTSGPLPEAIA^ cc 
cagcctcctgtcctgccttcttagctttcccCGGccagggccagatalagcctggcagctgtcttgtaaaggaagttggat 
tggaacacagacacactcactcacagtctcatctgcgactgattttgtgcttcggtggcagaatcgaatggttgagtatccc^ 
gagttcctcagatgccctatggaggcx:acagtagccccatgacttttctcctttatggagatattgccaactttgattt^ 
caaccttgtgGTGacagcccctcctgtgggttggacctcactctcttcctgtcttGACcttcccaacctgctgggcctgg 

30 tgggggaccagctgggcgacacagagcggaagcggcggcatgccaagccaggcagctacagcatcgaggtgctgct 
ggtggtggacgactcggtggttcgcttccatggcaaggagcatgtgcagaactatgtcctcaccctcatgaatatcgtgA 
ATgagatttaccacgatgagtccctgggggttcatataaatattgccctcgtccgcttgatcatggttggctaccgacagtc 
cctgagcctgatcgagcgcgggaacccctcacgcagcctggagcaggtgtgtcgctgggcacactcccagcagcgcca 
ggaccccagccacgctgagcaccatgaccacgttgtgttcctcacccggcaggactttgggccctcaggtatgcaaGG 

35 Gtatgcacccgtcactggcatgtgtcaccccctgaggagctgtgccctcaaccatgaggatggcttctcctcagccttcgt 
gatagctcatgagaccggccacGTGctcggcatggagcatgacggtcaggggaatggctgtgcagatgagaccagc 
ctgggcagcgtcatggcgcccctggtgcaggctgccttccaccgcttccattggtcccgctgcagcaagctggagctcag 
ccgctocctcCCCtcctacgactgcrt^^ 

atcaactactcaatggatgagcagtgccgctttgacmggcagtggctaccagacctgcttggcattcaggaccm 
40 cctgcaagcagctgtggtgcagccatcctgacaacccgUcttctgcaagaccaagaaggggcccccgctggatgggac 
tgagtgtgcacccggcaagtggtgcttraaaggtc^ 

ggctggagctcctggaccaagtttgggtcatgttcgcggtcatgtgggggcggggtgcgatccxjgcagccggagctgca 
acaaccccTCCccagcctetggaggccgcctgtgcttagggcccatgttcgagtaccaggtctgcaa^ 
gccctgggacctacgaggacttccgggcccagcagtgtgccaagcgcaactcctactatgtgcaccagaatgccaagca 
45 cagctgggtgccctacgagcctgacgatGACgcccagaagtgtgagctgatctgccagtcggcggacacggggga^ 
gtggtgttcatgaaccaggtggttcacgatgggacacgctgcagctaccgggacccatacagcgtctgtgcgcgtg 
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agtgtgtgcctgtcggctgtgacaaggaggtggggtccatgaaggcggatgacaagtgtggagtctgcgggggtgacaa 
ctcccactgcaggactgtgaaggggacgctgggcaaggcctccaagcaggcaGCTctcaagctggtgcagatccca 
gcaggtgccaggcacatccagattgaggcactggagaagtccccccaccgcattGTGgtgaagaaccaggtcaccg 
gcagcttcatcctcaaccccaagggcaaggaagccacaagccggaccttcaccgccatgggcctggagtgggaggatg 
5 cggtggaggatgccaaggaaagcctcaagaccagcgggcccctgcctgaagccattgccatcctggtgagccccactC 
TTgatacacaaaatattaaagaacccaggcacagaccagac 

ctgC22 MOOSE14071 8022899..8022973, 80388 12..8039047, 

8039484..8039515, 8040059..8040156, 8041290..8041420, 8044820..8044992, 

8054710..8054858, 8060658..8060753, 8072693..8072840, 8074894..8075031, 
10 8078272. .8078377, 8081059..8081218, 8082756..8082839, 8083142..8083427, 

8084423. .8084440, 8O856O0..8085614, 8087906..8088011, 8105355..8105431, 

8108468..8108544, 81 18599..8118640 

LLLL1XLLLFLRQSWQ 

LSLLKIQKLDRHGLIRTR]^^ 
1 5 HCHIKDFSSSLVS VSI^TVLSRYSLISLPNALIJFIVDAPKPPTEDTYLRFDEYGS 

SGRPRRSAGKSQKGLNVETLVYADKKMVEKHGKGNV 

FKZ)GTIGSDINVVVVSL]I J LEQEPGGLLINHHADQSLNSFCQW 

RHDHAILLTGFDICSWKhffiP 

AHESGHKFGM1HDGEGWCRKAEGNIMSPTLTG 
20 I^TPQAGCLVDEPKQAGQYKYPDK^ 

FDICKSLWCHRVGHRCETKFMPAA^ 

HGQWSAWSKWSECSRTCGGGVKFQERHCNNPKPQYGGLFCPGSSRIYQLC 
NINPCNENSLDFRAQQCAEYNSKPFRGWFYQWKPYT^ 

ENFEFFFAMSGKVKDGTPCSPNK^VCIDGVCELVGCDHELGSKAVSDACG 

25 VCKGDNSTCKFYKGLFKQFSCLTLLKYYPWLIPAGARSIEIQELQVSSSYLA 
VRSLSQKYYT^TGGWSIDWPGEFPFAGTTFEYQRSFN^ 
CSVAQAGGQLRDLGSLQAPPPEFT ttgttgttgttgttgttgttgttgttgtttttgagacagagtt 
ggcagggacctataatctcagcgactcaggaggctgaggcagtagaatcacttgaacccaggaggcggaggttgCAT 
tcaggagttcgagaccagcctggtcaacatggtgaaaccctgtctctactgaaaatccaaaaattagacaggcatGGTtt 

30 aataaggacacgaaaaaatgaattcctcatctcgccattacctcagcttctggcccaggaacacaactacagctcccctgc 
gggtcaccatcctcacgtaCTAttggattgctttTATcattgtcacatcaaaGATttctcttcttcccttgtttcagtttct 
ctctcaactgtcttatccagatattctctaatatcccttccaaatgctcttctgttcatcgtagatgctcccaagcctcccacaga 
ggacacctatctaaggtttgatgaatatgggagctctgggcgacccagaagatcagctggaaaatcacaaaagggcctca 
atgtggaaaccctcgtggtggcagacaagaaaatggtggaaaagcatggcaagggaaatgtcaccacatacattctcac 

35 agtaatgaacatggtttctggcctatttaaagatggga^ 

ggaacaagaacctggaggattattgatcaaccatcatgcagaccagtctctgaatagtttttgtcaatggcagtctgccctca 
ttggaaagaatggcaagagacatgatcatgccatcttactaacaggatttgatatttgttcttggaagaatgaaccatgtgac 
actctaGGGtttgcccccatcagtggaatgtgctctaagtaccgaagttgtaccatcaatgaggacacaggacttggcctt 
gccttcaccatcgctcatgagtcagggcacAACtttggtatgattcacgatggagaagggaatccctgcagaaaggctg 

40 aaggcaatatcatgtctcccacactgaccggaaacaatggagtgtmcatggtcttcctgcagccgccagtatctcaagaa 
attcctcAGCacacctcaggcggggtgtctagtggatgagcccaagcaagcaggacagtataaatatccggacaaact 
accaggacagatttetgatgctgacacacagtgtaaatggcaam^ 

Wgcaaatcacmggtgccaccgagtaggccacaggtgtgagaccaagtttatgcc^gcagcagaag 
gcttgagtatgtggtgtcggcaaggccagtgcgtaaag 
45 cgcctggtcgaagtggtcagaatgttcccggacatgtggtggaggagtcaagttccaggagagacactgcaataacccc 

AAGc^tcagtatggtggcttattctgtccaggttctagccgtatttatcagctgtgcaatatt 
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cttggattttcgggctcaacagtgtgcagaatataacagcaaacctttccgtggatggttctaccagtggaaaccctatacaa 
aagtggaaGAGgaagatcgatgcaaactgtactgcaaggctgagaacmgaatttttltttgcaatgtccggcaaagt 
aagatggaactccctgctccccaaacaaaaatgatgtttgtattgacggggtttgtgaactagtgggatgtgatcatgaacta 
ggctctaaagcagtttcagatgcttgtggcgtttgcaaaggtgataattcaacttgcaagttttataaaggcCTGttcaagc 
5 aattctcctgcctcaccctcttgAAAtattatccggtggtcctcattccagctggcgcccgaagcatcgaaatccaggag 
ctgcaggmcctccagttacctcgcagttcgaagcctcag^caaaagtattacctcaccgggggctggagcatcgactgg 
cctggggagttccccttcgctgggaccacgtttgaata.ccagcgctctttcaaccgcccggaacgtctgtacgcgccagg 
gcccacaaatgagacgctgtgttgctctgtcgcccaggctggagggcagttgcgtgatctcggctcactgcaagctccgc 
ctcctgagttcacg 

10 ctg5007 MOOSE14072 990670..990921, 996695..997045, 1032626.. 1032735, 

1036728..1036833, 1038307..1038390, 1040553..1040712, 1042267.. 1042372, 
1050714..1050851, 1059675..1059832, 1073384..1073466, 1082939..1083087, 
1085587.. 1085759, 1087530.. 1087660, 1089712..1089876 
LIXWRCPI^PAFPLIJPSRLRL^ 

1 5 I^APRSPTAPAAPAAAAATPPPAPHGASPLLTLLISEYDLVSAYEVDHRGD^ 
VSHEIMHHQRRRRAVAVSEVESLHLRLKGP 

TLGKTGTKSVQTLPPEDFC^QGSLRSHRNSSVAl^TCQGLSGMIRTEEADY 
FLRPLPSHLSWKLGRAAQGSSPSHVLNEELNVETLVVV^ 
TTYVLTILNMVSAIJ^ 
20 QWQSGLMGKDGTRHDHAILLTG 

TINEDTGLGLAFnAHESGHNPGMIHDGEGNMCK^ 

SWSPCSRQYLHKFI^TAQAICLADQPKPVKEYKYPEKLPGELYDANTQCKW 
QFGEKAKLCMLX)FKKATLWCHRI 

QCVKYGDEGPKPTHGHWSDWSSWSPCSRTCGGGVSHRSRLCTNPKPSHGG 
25 KFCEGSTRTLKLCNSQKCPRDSVDFRAAQCAEHNSRRFRGRHYKWKPYTQ 
VEDQDLCKIYXIIAEGFDFFFSLSNK^ 
VLGSDAVEDVCGVCNGNNSACTIHRGLYTKHHHTl^ 

ctcctcctctggcgctgcccgctgtctcccgccttccctctgctcccctctcgcctccgcctcagcgccccgctg 
acctcgcctcctcccctctgctctttgtccctgcactctcccctcctcggtcctctgacccccccgccctcacctcctcccctc 
30 ctctctcccctgcccgccccgcgctctcccaccgctcccgccgcccccgccgccgcggctgccactccgccccccgcg 
ccgcacggagcttcacctctcctcactcttttgatttcagaatatgacctggtctctgcctacgaggttgaccacaggggcga 
ttacgtgtcccatgaaatcatgcaccatcagcggcggagaagagcagtggccgtgtccgaggttgagtctcttcaccttcg 
gctgaaaggccccaggcacgacttccacatggatctgaggacttccagcagcctagtggctcctggctttattgtgcagac 
gttgggaaagacaggcactaagtctgtgcagactttaccgccagaggacttctgtttctatcaaggctctttgcgatcacaca 
35 gaaactcctcagtggccctttcaacctgccaaggcttgtcaggcatgatacgaacagaagaggcagattacttcctaaggc 
cacttccttcacacctctcatggaaactcggcagagrt^ 

gaacgtggagaccttggtggtggtcgacaaaaagatgatgcaaaaccatggccatgaaaatatcaccacctacgtgctca 
cgatactcaacatggtatctgctttattcaaagatggaacaataggaggaaacatcaacattgcaattgtaggtctgattcttc 
tagaagatgaacagccaggactggtgataagtcaccacgcagaccacaccttaagtagcttctgccagtggcagtctgga 

40 ttgatggggaaagatgggactcgtcatgaccacgccatc^ 

gtgacactttgGGAtttgcacccataagtggaatgtgtagtaaatatcgcagctgcacgattaatgaagata^ 
gactggccttcaccattgcc^atgagtctggacacAACtttggcatgattcatgatggagaagggaacatgtgcaaaaa 
gtccgagggcaacatcatgtcccctecattggcaggacgcaatggagtcttctcctggtcaccctgcagcc 
acacaaamctaAGCaccgctcaagctatctgccttgrt^ 

45 aattgccaggagaattatatgatgcaaacacacagtgcaagtggcagttcggagagaaagccaagctctgcatgctggac 
tttaaaaaggcaACCctgtgg^gccatcgtattggaaggaaatgtgagactaaatttatgccagcagcagaaggc^ 
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tttgtgggcatgacatgtggtgccggggaggacagtgtgtgaaatatggtgatgaaggccccaagcccacccatggcca 

ctggtcggactggtcttcttggtccccatgctccaggacctgcggagggggagtatctcataggagtcgcctctgcacc^ 

ccccAAGccatcgcatggagggaagttctgtgagggctccart^ 

ccgggacagtgttgacttccgtgctgctcagtgtgccgagcacaacagcagacgattcagagggcggcactacaagtgg 
5 aagccttacactcaagtagaaGATcaggacttatgcaaactctactgtatcgcagaaggatttgatttcttctmcm 
aataaagtcaaagatgggactccatgctcggaggatagccgtaatgtttgtatagatgggatatgtgagagagttggatgtg 
acaatgtccttggatctgatgctgttgaagacgtctgtggggtgtgtaacgggaataactcagcctgcacgattcacaggg 
gtctctacaccaagcaccaccacaccaaccgtgagtactttagagctgcctgcaagccttgggcaaagaag 
ctgChr_lctg2 MOOSE14097 9792.9938, 17966..18005, 19452..19616, 

10 19867..19947, 22217..22312, 22920..23038, 23663..23728, 25005..25139, 
25313..25402, 30489.30625, 30979.31072, 32590.32718, 33172.33270, 
33920.34003, 34386.34489, 34836.34894, 35346.35465, 36313.36446, 
36559.36624, 37252.37347, 37646.37716, 38190.38286 
MSSYRPRVISRMPHLLRLLLAVWSQ 

1 5 TTPGCVIAAAAIULQNM^ 

VLRDELEVIIJECAVLENSTAKDRPAVEKARTLYRSCMN^ 
LEWGGWPVAMDRWNEW 

QNSSRHIIYIDQPTLGMPSREYYFNGGSNTaCVREAYLQFMVSVATLL 

LPRDSCLVQEDMVQVLELETQLAKATWQEEimDYIALYHRMGLEELQSQF 
20 GIXGFNWTLFIQTVL^SVKIBCLLPDEEVVVYGff 

LVWRLVI£>WGSLSQRFKD^ 

NAVGSLYVREAFPGDSKSMVREL^ 

EKAMSIREQIGHPDYILEEMNRRL^ 

SLRKLREKVDPNL\\^ 
25 FGGIGMVIGHEITHGFDDNGRNro 

QYGNYSWDLADEQNVNGFNTLGENIADNGGWQAYKA^ 

QQLPGLDLTHEQLFFINYAQVW 

AF ADTFHC ARGTPMHPKERCRVW atgtcctcttaccgtcccagggtcatatccagaatgcca 
catcttctcaggctcctcttggctgtgacagmcte^^ 

30 gaggattactgatcaggtgagcgaggtctgcaccacccctggctgcgtgatagcaGCAgctgccaggatcctccaga 
acatggacccgaccacggaaccgtgtgacgacttctaccagtttgcatgcggaggctggctgcggcgccacgtgatccct 
gagaccaactcaagatacagcatctttgacgtcctccgcgacgagctggaggtcatcctcaaaGCGgtgctggagaatt 
cgactgccaaggaccggccggctgtggagaaggccaggacgctgtaccgctcctgcatgaaccagAGTgtgataga 
gaagcgaggctctcagcccctgctggacatcttggaggtggtgggaggctggccggtggcgatggacaggtggaacga 

35 gaccgtaGGActcgagtgggagctggagcggcagctggcgctgatgaactcacagttcaacaggcgcgtcctcatcg 
acctcttcatctggaacgacgaccagaactccagccggcacatcatctacatagacxagcccaccttgg 
cgagagtactacttcaacggcggcagcaaccggaaggtgcgggaagcctacxtgcagttcatggtgtcagtggccac^ 
tgctgcgggaggatgcaaacctgcccagggacagctgcctggtgcaggaggacatggtgcaggtgctggagctggag 
acacagctggccaaggccacggtaccccaggaggagagacacgacgtcatcgccttgtaccaccggatgggactgga 

40 ggagctgcaaagccagmggcctgaagggamaactggactctgtt^ 

tgctgccagatgaggaagtggtggtctatggcatcxcctacctgc^aaccttgaaaacatcatcg 
AGGaccatac^gaactacxtggtctggcgcctggtgctggaccgcattggtagcctaag^ 
cgagtgaactaccgcaaggcgctgmggca^ 
gcaacatggagaacgccgtgggctcxctctacgte^ 

45 tcattgacaaggtgcggacagtgmgtggagacgctggacgag 

cgcaggagaaggccatgagcatccgggagcagatcgggcaccctgac^^ 
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gacgaggagtactccaatctgaacttctcagaggacctgtactttgagaacagtctgcagaacctcaaggtgggcgccca 
gcggagcctcaggaagcttcgggaaaaggtggacccaaatCTCtggatcatcggggcggcggtggtcaatgcgttct 
actccccaaaccgaaaccagattGTAttccctgccgggatcctccagccccccttcttcagcaaggagcagccacagg 
ccttgaactttggaggcattgggatggtgatcgggcacgagatcaQgcacggctttgacgacaatGGCcggaacttcg 
5 acaagaatggcaacatgatggattggtggagtaacttctccacccagcacttccgggagcagtcagagtgcatgatctacc 
agtacggcaactactcctgggacctggcagacgaacagaacgtgaacggattcaacacccttggggaaaacattgctga 
caacggaggggtgcggcaagcctataaggcctacctcaagtggatggcagagggtggcaaggaccagcagctgcccg 
gcctggatctcacccatgagcagctcttcttcatcaacM 

ccaatccatcaagacagacgtccacagtcccctgAAGgtactggggtcgctgcagaacctggccgccttcgcagaca 
10 cgttccactgtgcccggggcacccccatgcaccccaaggagegatgccgcgtgtgg 

ctg2019 MOOSE14111 40989203..40989313, 4 1002051.. 4 1002 170, 

41003054..41003155, 41005892..41005996, 41007222..41007332, 

41008443. .41008533, 41010003..41010205, 41015854..41015938, 

41016765. . 41016931, 41076010..41076027 
1 5 MTALDRACLYWLFLFKLLVID^ 

IQRVSRKFQTFPHFPWY^ 

MYYLKISQPSGNPKKITVin^ 

KLLRNLDFYVLPVUSflDGYIYTWTTO 

ASWCSIGASRNCQDQTFCGTGPVSEPETKAVASFIESKKDDILCFLTMHSYG 
20 QLILTPYGYTKNKSSNHPEMIQVGQKAANALK^ 

SGSSRDWARDIGIPFSYTFELRDSGTYGFVLPEAQIQPTCEETMEAVL 

LQKNPY atgactgcacttgaccgtgcttgtctttattggctgtt^ 

tggtcacttttetgtaactctcgccaactcaaaacate^ 

cattcaacgagtctctaggaagttccaaactmccacatm^ 
25 gagagatcagtgagaagtacaaggaagtggtgacacagcatttcctaggagtgacctatgagacccaccccatgtattatc 

tgaagatcagccaaccatctggtaatcccaagaaaatcatttggatggactgtggaattcacgccagagaatggattgctcc 

tgcttmgccaatggttcgtcaaagaaattctacaaaaccataaagacaactcaagtatecgcaagctccto 

acttctatgtccttccagttcttaacatagatggttatatctacacttggacaactgatcgtcmggaggaaatcccgttcaccc 

cataataatggcacatgtmgggacggatctcaatcgaaam^ 
30 ccaagatcaaacattctgtgggacagggccagtgtctgaaccagagactaaagctgttgccagcttcatagagagcaaga 

aggatgatattttgtgcttcctgaccatgcactc^ 

gtaaccacccagaaatgattcaagttggacagaaggcagcaaatgcattgaaagcaaagtatggaaccaattatagagtt 
ggatcgagtgcagaUtmaTATgcctcatcagggtcttcaagagattgggcccgagacattgggattcccttctcatat 
acgtttgagctgagggacagtggaacatatgggtttgttctgccagaagctcagatcxagcccacctgtgaggagaccat 

3 5 ggaggctgtgctgtcagtcctggatgatctacaaaaaaacccatat 

ctg2019 MOOSE14139 2792089..2792135, 2796149..2796193, 

2827337..2827397, 2829453. .2829602, 283 1346..283 1494, 2832006..2832048, 
2836649..2836812, 2843539..2843636, 2844957..2845004, 2845870..2845948, 
2846070..2846139 PNTQNEIMPLCLELGrR5YHSGFCQDCFRRNEDISHSI^ 

40 PAAVSSAHPVPKHIKKPDYVTTGIW 

VLLLAGKSLKVDMTTEETOALVHREnSHNAYPSPLGYGGFPKSVCTS 
CHGIPDSRPI^DGDriNEDVTVYYNGYHGDTSETFLVGNVDECG 
RCRDEAIAACRAGAPFSVIGNTISHITHQNGFQVCT 
WHHANDSDIPMEEGMAFraPIITEGSPEFKVL 

45 TVLITSRGAQILTKLPH cctaacacacaaaaccatatgcctctgtgtctagagttaggcatcAGGagt 
tatcactctgggtmgccaagattgctttagaaggaatGAAgatatttcacacagtatagmtgccgg^ 
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. gctcatccggttcctaagcacataaagaagccagactatgtgacgacaggcattgtaccagactggggagacagcatag 
aagttaagaatgaagatcagattcaagggcttcatcaggcttgtcagctggcccgccacgtcctcctcttggctgggaaga 
gtttaaaggttgacatgacaactgaagagatagatgctcttgtt^ 
taggctatggaggtmccaaaatctgmgtacctctgtaaacaa 

5 caggatggagatatotcaacattgatgtcacagtctatt 

caatgtggacgaatgtggtaaaaagttagtggaggttgccaggaggtgtagagatgaagcaattgcagcttgcagagcag 

gggctcccttctctgtaattggaaacacaatcAGCcacataactcatcagaatggttttcaagtctgtccacattt^ 
acatggaataggatcttactttcatggacatccagaaatttggcatcatGCAaacgacagtgatctacccatggaggagg 
gcatggcattcactataGAGccaatcatcacggagggatcccctgaatttaaagtcctggaggatgcatggactgtggtc 
10 tccctagacaatcaaAGGtcggcgcagttcgagcacacggttctgatcacgtcgaggggcgcgcagatcctgaccaa 

actaccccat 

ctg_2 MOOSE14190 27561 13..2756274,-2759580..2759716, 2761344..2761618, 
2764429.-2764565 RIVGGMEASPGEFPWQASLRENKEHFCGAAIINARWLV 
SAAHCI^FQDPTKWAYVGATYI^GSEASTVRAQWQIVKHPLYNADTA 

1 5 DFDVAVLELTSPLPFGRfflQPVCLPAATHIFPPSKKCLISGWGYLKSDFLVKPE 
VLQKATVELLDQALCASLYGHSLTDRMVCAGYLDGKVDSCQGDSGGPLVC 
EEPSGRFFLAGIVSWGIGCAEARRPGVYARVTRL^ aggatcgtg 
ggcggcatggaagcatccccgggggagtttccgtggcaagccagccttcgagagaacaaggagcacttctgtggggcc 
gccatcatcaacgccaggtggctggtgtctgctgctcactgcttcaatGAGttccaagacccgacgaagtgggtggcct 

20 acgtgggtgcgacctacctcagcggctcggaggccagcaccgtgcgggcccaggtggtccagatcgtcaagcacccc 
ctgtacaacgcggacacggccgactttgacgtggctgtgctggagctgaccagccctctgcctttcggccggcacatcca 
gcccgtgtgcctcccggctgccacacacatcttcccacccagcaagaagtgcctgatctcaggctggggctacctcaagg 
aggacttcCTGgtcaagccagaggtgctgcagaaagccactgtggagctgctggaccaggcactgtgtgccagcttgt 
acggccattcactcactgacaggatggtgtgcgctggctacctggacgggaaggtggactcctgccagggtgactcagg 

25 aggacccctggtctgcgaggagccctctggccggttctttctggctggcatcgtgagctggggaatcgggtgtgcggaag 
cccggcgtccaggggtctatgcccgagtcaccaggctacgtgactggatcctggaggccaccaccaaagcc 
ctg22fin4 " MOOSE14210 16664895..16664909, 16676917.. 16677063, 
16677651. .16677787, 16679898.. 16680169, 16681309.. 16681451 
RIVGGAVSSEGEWPWQASLQVRGRHICGGALIADRWVrrAAHCFQEDSM^ 

30 TVLWTWLGKVWQNSRWPGEVSFKVSRLLLHPYHEEDSHDYDVALLQLDH 
PVVRSAAVRPVCLPARSHFFEPGLHCWITGWGALREGGP 
PQDLCSEVYRYQVTPRMLCAGYMCGKKDACQGDSGGPLVCKAI^GRWFI^ 
GLVSWGLGCGRPNYFGVYTRTTGVISWIQQTMAQS cgcattgttggtggagctgtgtcc 
tccgagggtgagtggccatggcaggccagcctccaggttcggggtcgacacatctgtgggggggccctcatcgctgac 

35 cgctgggtgataacagctgcccactgcttccaggaggacAGCatggcctccacggtgctgtggaccgtgttcctgggc 
aaggtgtggcagaactcgcgctggcctggagaggtgtccttcaaggtgagccgcctgctcctgcacccgtaccacgaag 
aggacagccatgactacgacgtggcgctgctgcagctcgaccacccggtggtgcgctcggccgccgtgcgccccgtct 
gcctgcccgcgcgctcccacttcttcgagcccggcctgcactgctggattacgggctggggcgccttgcgcgagggcG 
GCcccatcagcaacgctctgcagaaagtggatgtgcagttgatcccacaggacctgtgcagcgaggtctatcgctacca 

40 ggtgacgccacgcatgctgtgtgcxggctaccgcaagggcaagaaggatgcctgtcagggtgactcaggtggtccgct 
ggtgtgcaaggcactcagtggcxgctggttcctggcggggctggtcagctggggcctgggctgtggcc^^ 
ttcggcgtctacacccgcatcacaggtgtgatcagctggatccagcaaaccatggcccagagc 
ctg__2 MOOSE14212 2804144..2804283, 2805028..2805293, 2805383..2805519, 
2805953. .28061 10, 2806154..2806157 
45 RWGGSAAGRGEWPWQVSLWLRRREHRCGAVLVAE 

DPKQWAAFLGTPFI^GAEGQLERVARIYKHPFYNLYTI^ 
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RRSRLVRPICLPEPAPRPPDGTRCVITGWGSW 

QTCRIIPYPVQISSRMLCAGFPQGGVDSCSGDAGGPLACI^PSGRWVLTGVT 
SWGYGCGRPHFPGVYTRVAAVRGWIGQfflQDN aggattgtgggcggcagcgcag 
cgggccgtggggagtggccgtggcaggtgagcctgtggctgcggcgccgggaacaccgttgcggggccgtgctggtg 
5 gcagagaggtggctgctgtcggcggcgcactgcttcgacGTCtacggggaccccaagcagtgggcggccttcctag 
gcacgccgttcctgagcggcgcggaggggcagctggagcgcgtggcgcgcatctacaagcacccgttctacaatctcta 
cacgctcgactacgacgtggcgctgctggagctggcggggccggtgcgtcgcagccgcctggtgcgtcccatctgcct 
gcccgagcccgcgccgcgacccccggacggcacgcgctgcgtcatcaccggctggggctcggtgcgcgaaggaGG 
Ctccatggcgcggcagctgcagaaggcggccgtgcgcctcctcagcgagcagacctgccgccgcttctacccagtgca 

10 gatcagcagccgcatgctgtgtgccggcttcccgcagggtggcgtggacagctgctcgggtgacgctgggggacccct 
ggcctgcagggagccctctggacggtgggtgctaactggggtcactagctggggctatggctgtggccggccccacttc 
ccaggtgtctatacccgggtggcagctgtgagaggctggataggacagcacatccagGACaac 
ctgCHR3_ll MOOSE14225 15721072..15721214, 15722598.. 15722866, 
15724464..15724600, 15726636..15726800 

1 5 RnGGTDTLEGGWPWQVSLHFVGSAYCGASVISREWLLS AAHCFHGNRLSDP 

TPWTAHLGMYVQGNAKFVSPV 

LKQLIQPICIPPTGQRWSGEKCWTGWGRRHEADNKGSLVLQQAEVELIDQ 
TLCVSTYGETSRMLCAGIMSGKRDACKGDSGGPLSCRI^SDGKWILTGIVS 
WGHGSGRPNFPGVYTRVSNFVPWIHKYVPSL cgcatcatcggaggcacagacaccctgga 

20 ggggggttggccgtggcaggtcagcctccactttgttggatctgcctactgtggtgcctcagtcatctccagggagtggctt 
ctttctgcagcccactgttttcatggaaacAGGctgtcagatcccacaccatggactgcacacctcgggatgtatgttcag 
gggaatgccaagmgtctccccggtgagaagaattgtggtccacgagtactataacagtcagacttttgattatgatattgct 
ttgctacagctcagtattgcctggcctgagaccctgaaacagctcattcagccaatatgcattcctcccactggtcagagag 
ttcgcagtggggagaagtgctgggtaactggctgggggcgaagacacgaagcaGATaataaaggctccctcgttctg 

25 cagcaagcggaggtagagctcattgatcaaacgctctgtgtttccacctacgggatcatcacttctcggatgctctgtgcag 
gcataatgtcaggcaagagagatgcctgcaaaggagattcgggtggacctttatcttgtcgaagaaaaagtgatggaaaat 
ggatmgactggcattgttagctggggacatggaagtggacgaccaaacmcctggtgtttacacaagggtgtcaaacttt 
gttccctggattcataaatatgtcccttctctt 

ctg4012 MOOSE14231 9641523..9641665, 9645343..9645602, 

30 9650813..9650955, 9656232..9656386, 9656688..9656691 

RIVQGRETAMEGEWPWQASLQLIGSGHQCGASLISNTWLLTAAHCFWE^K 

DPTQWIATFGATITPPAVK^^ 

VQRVCIPDSSKLPPKTSVFVTGFGSIVDDGPIQNTLRQARVETK 
DVYDGIJrTPGMLCAGFME 
3 5 C ALPKKP GV YTRVTKYRD WIAS KTGMN agaattgtccaaggaagggaaacagctatg 
gaaggggaatggccatggcaggccagcctccagctcatagggtcaggccatcagtgtggagccagcctcatcagtaac 
acatggctgctcacagcagctcactgctmggAAAaataaaga 

acaccaccxgcagtgaaacgaaatgtgaggaaaattattcttcatgagaattaccatagagaaacaaatgaaaatgacatt 

gcmggttcagctctctactggagttgagtmcaaatatagtccagagagmgcctcccagactcatc^ 
40 ctaaaacaagtgtgttcgtcacaggatttggatccattgtagatgatGGAcctatacaaaatacacttcggc^ 

gtggaaaccataagcactgatgtgtgtaacagaaaggatgtgtatgatggcctgataactccaggaatgttat^ 

tcatggaaggaaaaatagatgcatgtaagggagattetggtgga 

ggtatagtaagttggggacaatcatgtgcacttcccaaaaaa^ 

attgcctcaaagactggtATGaac 
45 ctg2019 MOOSE14237 67883 182..678833 13, 67883686.-67883772, 

67883860..67884019,.67884218..67884360, 67884574..67884741 
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RIVGGS AAPPGAWP WLVRLQLGGQPLCGGVLV AAS WVLTAAHCFLLWTVT 
LAEGSRGEQAEEWVNWLPHPKFDPRTFHNDLALVQLWTPV 
LPQEPQEPPAGTACAIAGWGALFEDGPEAEAVREARVPLLSTDTCRRALGPG 
LRPSTMLCAGYLAGGVDSCQGDSGGPLTCSEPGPRPREVLFGVTSWGDGCG 
5 EPGKPGVYTRVAVFKDWLQEQMSGE cgcatcgtggggggcagcgcggcgccgcccggggc 
ctggccctggctggtgaggctgcagctcggcgggcagcctctgtgcggcggcgtcctggtagcggcctcctgggtgctc 
acggcagcgcactgctttcttctgtggactgtgacgctggcagaggggtcccggggggagcaagcggaggaggtgcca 
gtgaaccgcatcctgccccaccccaagtttgacccgcggaccttccacaacgacctggccctggtgcagctgtggacgc 
cggtgagcccggggggatcggcgcgccccgtgtgcctgccccaggagccccaggagccccctgccggaaccgcctg 

10 cgccatcgcgggctggggcgccctcttcgaaGACgggcctgaggctgaagcagtgagagaggcccgtgttcccctg 
ctcagcaccgacacctgccgaagagccctggggcccgggctgcgccccagcaccatgctctgcgccgggtacctggc 
ggggggcgttgactcgtgccagggtgactcgggaggccccctgacctgttctgagcctggcccccgccctagagaggt 
cctgttcggagtcacctcctggggggacggctgcggggagccagggaagcccggggtctacacccgcgtggcagtgtt 
caaggactggctccaggagcagatgagcggtgag 

15 ctg4012 MOOSE14238 9538614..9538772, 9541857..9541999, 

9546242..9546501, 9550024..9550160 
RIASGVIAPKAAWPWQASLQYDNIHQCGATLISNTW 

QWWSFGTKWPLMKim^ 

QICLPEASASFQPNLTVmTGFGALYYGGESQNDLREARVKnSDDVCKQPQV 

20 YGNDIKPGMFCAGYMEGIYD 

CGQKDKPGVYTQVTYYRNWIASKTGI agaatagcatctggagtcattgcacccaaggcggcct 
ggccttggcaagcttcccttcagtatgataacatccatcagtgtggggccaccttgattagtaacacatggcttgtcactgca 
gcacactgcttccagAAGtataaaaatccacatcaatggactgttagtmggaacaaaaatcaaccctcccttaatgaa^ 
agaaatgtcagaagatttattatccatgagaagtaccgctctgcagcaagagagtacgacattgctgttgtgcaggtctcttc 

25 cagagtcaccttttcggatgacatacgccagatttgtttgccagaagcctctgcatccttccaaccaaatttgactgtccacat 
cacaggatttggagcactttactatggtGGGgaatcccaaaatgatctccgagaagccagagtgaaaatcataagtgat 
gatgtctgcaagcaaccacaggtgtatggcaatgatataaaacctggaatgttctgtgccggatatatggaaggaatttatg 
atgcctgcaggggtgattctgggggacctttagtcacaagggatctgaaagatacgtggtatctcattggaattgtaagctg 
gggagataactgtggtcaaaaggacaagcctggagtctacacacaagtgacttattaccgaaactggattgcttcaaaaac 

30 aggcatc 

ctgCHR12_3 MOOSE14241 16346585..16346749, 16348623..16348765, 
16374722..16374999, 16389747..16389901 

RnGGTEAQAGAWPWVVSLQIKYGRVLVHVCGGTLVRERWVLTAAHCTKD 

ASDPLMWTAVIGTNNfflGRYPHTKKIKIKA^ 

35 VRYNDYIQPICLPFDWQILDG 

HYISREMCNSERSYGGIIPNTSFCAGDEDGAFDTCRGDSGGPIMCYLPEYKR 

FFVMGITSYGHGCGRRGFPGVYIGPSFYQKWLTEHFFHA cggattatagggggca 
ccgaagcacaagctggcgcatggccgtgggtggtgagcctgcagattaaatatggccgtgttcttgttcatgtatgtgggg 
gaaccctagtgagagagaggtgggtcctcacagctgcccactgcactaaagacgctAGCgatcctttaatgtggacag 

40 ctgtgattggaactoataatatacatggacgctatcctcate^ 
tcatmggaatcttatgtaaatgatattgcac^^ 

acctmgatgtmccaaatcctggacggaaacacaaagtgttttataagtggctggg^ 
atGGTaacgctacaaaUttttacaagatgcagaagtgcattatamctcgagagatgtgtaattctgag^^ 

gggaataattcctaacacttcatmgtgcaggtg^ 
45 taatgtgctacttaccagaatataaaagatttmgtotgggaattaccagttacggacatggctgtggtcgaagaggtttt^ 

tggtgtctatattgggccatccttctacx^aaaagtggctgacagagcatttcttccatgca 
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ctgCHR12_2 MOOSE14247 26347557.-26347748, 26349485.. 26349636, 
26350962..26351057, 26355150..26355322, 26356046..26356143, 
26356421 ..26356468 RISSWRNSTVTGHPWQVSLKSDEHHFCGGSLIQEDRW 
TAAHCLDSLSEKQLKMTVTSGEYSLFQKDKQEQNIPVSKnTHPEYNSREYM 
5 SPDIALLYLKHKVKFGNAVQPICLPDSDDKVEPGILCLSSGWGKISKTSEYSN 
VLQEMELPDvn)DRACNWLKSMhtt.PPLGRTMLCAGFPDWG]VlI)ACQGDSG 
GPLVCRRGGGIW1IAGITSWVAGCAGGSWVRNNHVKASLGIFSKVSELMDF 

ITQNLFTG (SEQIDNO: 79) 

agaattagtagttggagaaattcaacagtgactggacatccatggcaggtctccctaaaatcagatgagcaccacttctgtg 
10 gaggaagcttgattcaagaagatcgggttgttacagcagcacactgcctggacagcctcagtGAGaagcagctgaag 
aatataactgtgacttctggggagtacagcctctttcagaaggataagcaagaacagaatattcctgtctcaaaaattattac 
ccatcctgaatacaacagccgtgaatatatgagtcctgatattgcactgctgtatctaaaacacaaagtcaagtttGGAaat 
gctgttcagccaatctgtcttcctgacagcgatgataaagttgaaccaggaattctttgcttatccagtggatggggcaagat 
ttccaaaACAtcagaatattcaaatgtcctacaagaaatggaacttcccatcatggatgacagagcgtgtaatactgtgct 
15 caagagcatgaacctccctcccctgggaaggaccatgctgtgtgctggcttccctgattggggaatggacgcctgccagg 
gggactctggaggaccactggtttgtagaagaggtggtggaatctggattcttgctgggataacttcctgggtagctggttg 
tgctggaggttcagttc«cgtaagaaacaaccatgtgaaggcatcacttggcattttctccaaagtgtctgagttgatggattt 
tatcactcaaaacctgttcacaggt (SEQ ID NO: 32) 

ctg4012 MOOSE14251 10140958..10140985, 10163010..10163036, 

20 10189245..10189369, 10190482..10190601, 10191060..10191348, 
10192103..10192190, 10195165..10195210 

EIWSGEQGQNDMVWLSSLKMSGQHYCGASLISERHLVTAAHCFKVTKNPK 

NYWSFGTKVTLPYMQHDVQQHIHEDYIQDEHHDDIALILLTKKVLF^ 

RVCLPEATQIFPPGEGVVVTGWGPJ^FNGOSENLTYHKASVKITDTNTCNA 

25 KEAYRSMVQDRVLCAGYMEGNIDACQGDSGGPLVHPNSLNIWYIWYLVGV 
VSWGRNECGAINSPGVYTQTDVFFFLKW1KSTIALK (SEQ ID NO: 80) 
gagatttggtcaggggaacaggggcagaatgatatggtttggctcTCAagccttaaaatgagtgggcaacactactgtg 
gggcatcattgatcagtgaaagacacttggtgactgcagctcactgttttaaaGTGacaaaaaatccaaaaaactatact 
gtcagctttggcacgaaagtaactcttccctatatgcaacatgatgttcaacaaattattattcatgaagactacatccaggat 

30 gaacatcatgatgatatcgcacttatactgctcactaaaaaagttttatttaagaatgatgtacatcgagtttgtcttcctgaagc 
cacacagatttttccawtggtgaaggagttgttgttacaggatggggaagactttcatttaatggtaagatcagtgaaaactt 
aacataccataaagcatctgtgaagattactgatacaaacacttgtaatgctaaagaagcctatcgtagtatggtacaggata 
gagtgctatgtgctgggtacatggaaggaaatatagacgcGtgccagggagactctggaggaccactagttcatcctaatt 

ctctaaatatttggtatatttggtaccttgttggag^ 

35 gtctacACAcagacagatgtcttttttttWaAAGtggatcaaaagcacaattgctctcaaa (SEQ ID NO: 33) 
ctg4015 MOOSE14254 14157594..14157630, 14167829..14167892, 

14178504.. 141 78769, 14179473..14179642, 14187478..14187633 
RWSMESKKGKVQWLVVLFGSSSIQGSRKDKAKTWTTFSYTVWLGSrrVGD 
SPJGIVKYYVSKIVIHPKYQDTTADVALLKLSSQVTFTSAILPICLPSVTKQLAI 

40 PPFCWVTGWGKVKESSDRDYHSALQEAEVPnDRQACEQLYNPIGIFLPALEP 
VIKEDKICAGDTQNMKDSCKGDSGGPLSCHIDGVWIQTGVVSWGLECGKSL 

PGVYTNVIYYQKWINATISRA (SEQ ID NO: 81) 

aggatagtcagcatggaatctaagaagggaaaagtcCAAtggctagtggtcctgtttggcagctcttccattcagggaa 
gcaggaaagamggccataAAGawtggactacttWcatatactgtgtggctaggatcgattacagtaggtgactca 
45 aggaaacgtgtgaagtactacgtgtccaaaatcgtcatccatcccaagtaccaagatacaacggcagacgtcgccttgttg 
aaactgtcctctcaagtc^ttcacttctgccatcctgwtamgcttgcccagtgtcacaaagcagttggcaattcc 
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tttgttgggtgaccggatggggaaaagttaaggaaagttcaGATagagattaccattctgcccttcaggaagcagaagt 
acccattattgaccgccaggcttgtgaacagctctacaatcccatcggtatcttcttgccagcactggagccagtcatcaag 
gaagacaagatttgtgctggtgatactcaaaacatgaaggatagttgcaagggtgattctggagggcctctgtcgtgtcaca 
ttgatggtgtatggatccagacaggagtagtaagctggggattagaatgtggtaaatctcttcctggagtctacaccaatgt 
5 aatctactaccaaaaatggattaatgccactatttcaagagcc (SEQ ID NO: 34) 

ctgchr7_ctgl8037 MOOSE14260 3060956..3061 102, 3061204.3061343, 
3063795.3064042,3064266.3064399 
LAFNPDYTVS STPP YLVYIXSD YIJPC AGVL^ 
TIP ADSNEKHLQVIGYEKMfflOT 
1 0 LPYQTISENTMCSVSTWSYNVYKia?^ 
MLCVGIWGRRQPCKEVSAAPAICNGML 
YIPWIENVIQNN (SEQ ID NO: 82) 

ttggccmaatccagattacacagtcagctccactcccccttacttggtctatttgaaatctgactacttgccctgcgctggag 
tcctgatccacccgctttgggtgatcacagctgcacactgcaatttaCCAaagcttcgggtgatattgggggttacaatcc 

15 cagcagactctaatgaaaagcatctgcaagtgattggctatgagaagatgattcatcatccacacttctcagtcacttctattg 
atcatgacatcatgctaatcaagctgaaaacagaggctgaactcaatgactatgtgaaattagccaacctgccctaccaaa 
ctatctctgaaaataccatgtgctctgtctctacctggagctacaat^gTACaaagagcccgattcactgcaaactgtga 
acatctctgtaatctccaagcctcagtgtcgcgatgcctataaaacctacaacatcacggaaaatatgctgtgtgtgggcatt 
gtgccaggaaggaggcagccctgcaaggaagtttctgctgccccggcaatctgcaatgggatgcttcaaggaatcctgtc 

20 ttttgcggatggatgtgtmgagagccgatgttggcatctat^^ 
aaaataac (SEQ ID NO: 35) 

ctg2014 MOOSE14265 21780003..21780336, 21782470..21782645, 

21786391. .21786549 RWAAGVRWAQHSEEPPHNRSTNPSDYRILLGYT)QQS 
HPTEHSKQMT VNKI3N1VHADYNELHRM SHILP ACLPE 

25 PTTWLAPDSSCWISGWGMW 

YPGQPSSSDYTIHEDMLCAGDLrTGK^ 

WSLDCCSPVGPRVFTRLPYFTNWISQKE^ES (SEQ ID NO: 83) 

aggtgggcagccggggtgagggtgccggcccagcattcagaggagcctccccacaacaggtccactaacccatctgat 

taccggatcctgcttgggtatgaccagcaaagccatcccacagagcacagcaagcagatgacagtgaataagatcatggt 

30 gcacgctgactataacgagttgcaccgcatggggagtgacatcaccctgctgcagctgcaccatcatgtggaattcagctc 
ccacatcctcbccgcctgccttccggaaccaaccacgtggctggcccctgacagctcctgctggatatctggtt 
ggtcaccgaggatGTCttcctgcctgagcccttccaacttcaggaggcagaggtcggtgtcatggacaacactgtctgc 
ggatcctttttccagccccagtaccccggccagccaagcagcagtgactacaccatccacgaggacatgctgtgcgctgg 
ggacctcataacaggaaaggccatttgccgacgagactccaggggtcccctcgtctgcccattaaatggcacctggttcct 

35 gatggggctgtctagttggagcctcgactgctgctcacccgtcggtcccagggtcttcaccaggctcccctacttc 
ctggatcagccagaagaagagggagagc (SEQ ID NO: 36) 
ctgC20 MOOSE14272 1992244.. 1992402, 19931 17..1993202, 

1996233..1996286, 2017918..2017988, 2050580..2050652, 2054829.. 2054897, 
2058461. .2058508, 2059496..2059523, 2064130..2064150 

40 RWS GYPS ANMVSTP WRTGILHFNHCIHDLSQTVLGDHL 
DHAVALLFLQIS SIWNGNIYPIPLP AFVS YK^ 

PLCARVDRQQGEQCEHTEFGYQPETIKNDMLCAGFEEGKKDACX 
VCLVGQSWLQAGVISWGEGCARQNRPGVYIRW (SEQ 
ID NO: 84) 
45 agagttgjttctggatacttttcagcaaacatggm^ 

gatctgagccaaA.CAgtcctgggggatcatttagtteaattccatcatactataaagatta 
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CTgtggccctmgtttttgcaaatttcttccam^ 

agaatgctagtamgtaggatcatgttgtggggacatgctggggacatgctmccccATGaactttcccttgtgtgcccg 
cgtggacagacaacagggggagcagtgcgagCACaccgagtttggctaccaacccgaaaccatcaagaatgacatg 
ctgtgcgccggcttcgaggagggcaagaaggatgcctgcaagggcgactcgggcggccccctggtgtgcctcgtgggt 
5 cagtcgtggctgcaggcgggggtgatcagctggggtgagggctgtgcccgccagaaccgcccaggtgtctacatccgt 
gtcaccgcccaccacaactggatccatcggatcatccccaaactg (SEQ ID NO: 37) 
ctgChr_lctgl09 MOOSE14278 4832..4921, 5738..S839, 6532..6615, 
6661. .6758, 10595.. 10623, 15756.. 15898, 16240.. 16398 
HENKjKRQIAFPRRPGTREGCPLL^ 

1 0 KTffKATVIKTDGHY^ 

GRVAHAC^STLGGRGGWITRWGSHYVAQAGETSDELQEMQLPLILEPWC 
HLLYGHMSYIMPDMLCAGDrLNAKTVCEGDSGGPLV 
GRGCSNPLYPGVYASVSYFSKWICDNIEIT (SEQ ID NO: 85) 
catataatcaatggtaaaagacagatagctttcccccg^ 

15 caatgcacactgcactccgccatgggcaacagagcaagactccaactcaaaaaaaaaaaaaaaaaaagagacagaga 
aaacaattccaaaagctacagttatcaaaacagatggccactataaagaaaacaaaaacagaaaacatcaagtgttggca 
aagatgtggagaaattggaacctttatgcactgttggttttctgcaagattaaacatagaattactgagccaggcagggtggc 
tcacgcctgtaatcccagcactttgggaggccgaggcgggtggatcacgAGAtgggggtctcactatgttgcccaggc 
tGGTgagacctcagacgagctgcaggagatgcagctcccgctgatcctggagccctggtgccacctgctctacggac 

20 acatgtcctacatcatgcccgacatgctgtgtgctggggacatcctgaatgctaagaccgtgtgtgagggcgactccggg 
ggcccacttgtctgtgaattcaaccgcagctggttgcagattggaattgtgagctggggccgaggctgctccaaccctctgt 
accctggagtgtatgccagtgtttcctatttctcaaaatggatatgtgataacatagaaatcacg (SEQ ID NO: 38) 
ctgChr_6ctg20 MOOSE14279 3772814..3772839, 3773615..3773663, 

3774882.3774902, 3776288.3776466, 3786503. .3786609, 3803444..3803462, 

25 381 1761. .381 1774, 3825365.3825494, 3826636.3826668, 3842185.3842229, 

3846239.3846299 RVSGGRDSVPSLVPSTNAYNRKRPENPHMCGGFLASNI 
EHLLCARHRIQKSMTSAHRSKVRRI^ 

NYDISLLGLASPAVITDKWACLPSPNYWADQTECYITDWGETQGTFGAGF 
LKEAQLPVIENEVCNRYEFLNGRVKSTELCAGHLAGGIDSCKVRKDQETKVS 

30 LFGIGCGDWVRSPHFYTYIHTYTPSIQENIKEN (SEQ ID NO: 86) 

agggtctctggaggtagggacagtgtcccatctttggtaccatccaccaatgcctacaacAGGraagaggcctgagaac 
cctcacatgtgtggaggtttcctggccTCAaacattgagcacctgctgtgtgctaggcacAGGattcaaaaatccatg 
acgtctgctcataggtcaaaggttaggagacttgaatctcattggtacaaagggaaaagaaagacaaggagtaaagagaa 
aaggaaaatatttggaaaatacaccagcaacATAaattacgacataAGTctgctgggtttggccAGTcctgccgtc 

35 atcactgacaaagtaatcccagcttgtctgccatccccaaattatgtggtcgccgaccagactgaatgttacatcactgactg 
gggagaaacccaaGGTacctttggggctggctttctcaaggaagcccagctccctgtgattgagaatgaagtgtgcaat 
cgctatgagtttctgaatggaagagtcaaatccactgagctctgtgctgggcamggctggaggcattgacagttgcaagg 
taagaaaagatcaagagaccaaagttagtcttmggtate^ 

tacacacatacacaccgTCCattcaagaaaatatcaaagaaaat (SEQ ID NO: 39) 
40 ctgl lctgl5 MOOSE14295 6743906..6743990, 6796305..6796757, 
6824377..6824594 RKLGILNHQVLFWYNLSlXIJtlFlGYKSYSEP 
DMDPRPSRSYQVANGIAVLPVSGTLVSKTRAL^^ 
SDPGVDGILLDMDTPGGMVSGAFDCADIIARM^ 

LIASSASRRLVTQTARTGSIGVMMAHSNYGAALKTNGGH CSTIHN 
45 SKDIJa>TQMPINNRLDKENVAHIHHG 
KLTQEQK (SEQ ID NO: 87) 
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agaaagctaggcatcttaaaccaccaggtactattttggtataatctatcacttctgttacattttattggatataaatcata 
Cgaaccgctggcgctgtttggtgaggatgatgacatggatccccgtccatcacgcagctatcaggtggcaaatggtatcg 
cggtcttgccggtttccggcacgctggtcagtaaaacccgtgcgcttcagccttattccgggatgacgggttacaacggga 
tcattgctcgcctgcagcaggcaatcagtgaccccggcgttgacggcattcttctggatatggatacgccgggtggaatgg 
5 tgtccggggcgtttgactgcgccgacattattgcccgtatgcgcgatatcaaacccatctgggcgctggccaatgacatga 
actgcagtgcaggtcagcttattgccagttcggcatcgcgacggctggtcacacaaacggccagaaccggctccattggg 
gtcatgatggcgcacagtaactatggcgctgcgctcaaaactaacggcGGGcacatgcacacatatgtttattgcagca 
ctattcacaatagcaaagacttaaaaccaacccaaatgcccatcaataatagactggataaagaaaatgtggcacatatac 
accatggaatactgtgcagtcataaaaaggatgagttcatgtcctttgcaggaacgtggatgaagctagaaaccatcattct 
10 cagcaaactaacacaggaacagaaa (SEQ ID NO: 40) 

ctgC31 MOOSE14296 605248..605324, 613146..613186, 625648. .625692, 

631870..631946, 63S586..639242 

MLGVLLQIWRGSWKKQTQAQGRRERSRQAAGAVSAGGRRALL^ 
DKLACVDSRLRLVMRGLVLGRASGSSVRPKLPKDVRADFQTRTO 
1 5 EKVSAYTGMSVQDVLDTEAAVFSGQESLDNG 

DRRKKTTLGGTMPSPSASAVTTKPVDQAATQTTASAEQATTVDTTIASV^ 
PVDVSAQVTAAVAAENSRIMGrLNCDEAKGRESQARALAETPGMTVESAQR 
ILAAAPQSAQMRTDTALDRLMETAPGALQAGSASSDAADDLLNTPV (SEQ 
ID NO: 88) 

20 atgctgggggtgcttcttcaaatctggagggggagctggaagaagcagacacaggcccagggccggagggagagaA . 
GCcgccaggcagccggggctgtgagtgcxggaggccgacgcGCActgttattgtatcttagagctgaactggaaga 
caaactggccTGTgtggacagcaggcttagactggtgatgagggggcttgtcctggggagggcctcaggcagctctg 
ttagacccaaacttccgaaagatgtgcgtgctgatttccagacgcgtatcgatgccactcgtcagatgtttgccgaaaaggtt 
tccgcttateccggcatgtctgttcaggacgtgctggacaccgaagcggcagtattctccggccaggaatctttggataac 

25 gggctggcggatgaacttgttaacaataccgatgcgctcggcgtgatgcgcgaagcactcgacagacgcaaaaaaacaa 
cccttggaggaactatgccatcaccttctgcatcagctgtgaccactaagccagttgaccaggcagcaactcagacaactg 
catcagctgaacaggccactaccgttgaGacgacaattgcttccgtagcagcccctgtagatgtcagtgcgcaggttactg 
cagcagtagctgcagagaatagtcgcatcatgggcatcctgaactgcgacgaggctaaagggcgtgagtcacaggcgc 
gagcactggccgaaacgccgggtatgacggtagagagcgcacagcgcattctggctgctgcaccgcaaagtgcccag 

30 atgcgtaccgatacggcgctggatcgtttgatggaaacagcacccggtgcactccaagcaggtagcgcatcttctgatgc 
cgctgacgatttgttaaacacccccgtt (SEQ ID NO: 41) 

ctg_2 MOOSE14301 1416405. .1416576, 1416840..1417019, 1417206.. 1417323, 
1417394.. 1417497, 1417583. .1417683, 1420409.. 1420621, 1420697.. 1420869, 
1421 159.. 1421247, 1421341. .1421417, 1421520.. 1421648, 1421744.. 1421776 
35 TDPWFSKQWYMNSEAQPDLSILQAWSQ 

ANYDPLASYDFNDYDPDPQPRYTPSKENRHGTRCAGEVAAMAN^ 
VAFNAItfGGVRMIJ)GTITD 

GILTREAFRRGVTKGRGGLGTLFWASGNGGLHYDNCNCDGYTNS 

GSTTQQGRWWSEACASTLTTTYSSGVATDPQIVTTDLHHGCTO 
40 AS APLAAGMIALAl^ANPFLTWRDMQHLVVR 

QVSHHYGYGLLDAGLLVDTARTWLPTQPQRK^ 

VSACAGLHNSIRSIJiHVQAQLTLSYSRRGDL^ 

STEGYNNWVFMSTBffWDENPQGVWI^ 

LGSPT (SEQ ID NO: 89) 
45 acggacxcrtggttctccaagcagtggtac^^ 

aggggctgtcaggccagggcaicgtggtctctgtg^^ 
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actacgaccccctggccagctatgacttcaatgactacgacccggacccccagccccgctacacccccagcaaagaga 
acCGGcacgggacccgctgtgctggggaggtggccgcgatggccaacaatggcttctgtggtgtgggggtcgctttc 
aacgcccgaatcggaGGCgtacggatgctggacggtaccatcaccgatgtcatcgaggcccagtcgctgagcctgca 
gccgcagcacatccacatttacagcgccagctggggtcccgaggacgacggccgcacggtggacggccccggcatcc 
5 tcacccgcgaggccttccggcgtggtgtgaccaagggccgcggcgggctgggcacgctcttcatctgggcctcgggca 
acggcggcctgcactacgacaactgcaactgcgacggctacaccaacagcatccacacgctttccgtgggcagcacca 
cccagcagggccgcgtgccctggtacagcgaagcctgcgcctccaccctcaccaccacctacagcagcggcgtggcc 
accgacccccagatcgtcaccacggacctgcatcacgggtgcacagaccagcacacgggcacctcggcctcagcccc 
actggcggccggcatgatcgccctagcgctggaggccAACccgttcctgacgtggagagacatgcagcacctggtg 

10 gtccgcgcgtccaagccggcgcacctgcaggccgaggactggaggaccaacggcgtggggcgccaaGTGagcca 
tcactacggatacgggctgctggacgccgggctgctggtggacaccgcccgcacctggctgcccacccagccgcaga 
ggaagtgcgccgtccgggtccagagccgccccACCcccatcctgccgctgatctacatcagggaaaacgtatcggcc 
tgcgccggcctccacaactccatccgctcgctggagcacgtgcaggcgcagctgacgctgtcctacagccggcgcgga 
gacctggagatctcgctcaccagccccatgggcacgcgctccacactcgtggccataCGAcccttggacgtcagcact 

15 gaaggctacaacaactgggtcttcatgtccacccacttctgggatgagaacccacagggcgtgtggaccctgggcctaga 
gaacaagggctactatttcaacacgggtgagggcggggcggggctgtggtgggcggggcttggctctccaacc 
(SEQ ID NO: 42) 

ctgl8ctg2 MOOSE14331 595090..595301, 599634..599677, 601386..601450, 

601725..601849, 609022..609148, 619640..619741 
20 MASRYDRMTVFSPDGHLFQVEYAQ 

VAKLQDERTVRiaCALDDHVCM 

DPVTVEYITRFIATLKQIOT^ 

AGYKINnK^QKFAAFLYANNEQSEKEIKX 

NENYKTLMQETEADTKK (SEQ ID NO: 90) 
25 atggcgtctcgalatgacagggcgatcactgtcttctccccagacggacacctttttcaagttgaatatgcccaggaagcgg 

tgaagaaaggatccaccgcggtcggaattcgaggtaccaatatagttgttcttggggtagaaaaaaaatctgttgccaagct 

tcaagatgaaagaactgtgaggaaaatttgtgcccttgatgaccatgtctgcatggcttttgcaGGActtactgctgatgct 

agagtagtaataaacagagcccgtgtggagtgccagagccataagcttacggttgaggacccagtcactgtagaatacat 

aactcgcttcatagcaactttaaagcagattaatocaaagagttamga 
30 ttTTTagctgggattaccggcacatgccaccacacctggctaacmTTTgcaggatacaaaatcaacaaacaaaaatt 

tgcagcatttctatatgccaacaatgaacaatctgaaaaagaaatcaagaaagtaatcccatttatgatagctacaaataaaa 

ttaaatgcatagaaataaacttaaccaaagaagtgaaagatttccacaatgaaaactataaaacactgatgcaagaaactga 

agcagacaccaaaaaa (SEQ ID NO: 43) 

ctgl8ctg2 MOOSE14335 803151..803182, 814164..814218, 819027..8 19098, 
35 831305..831344, 838218..838243, 839545. .839577, 840006..840128, 
860714..860833, 860952..861014, 887374..887493 
SKGGISVGLCVRDGVVWSRDTNSPHRVTPLLNELM 
AAJFISLRRS AEINKYVIYPRD VCTT YIVNRM 

FDDDGISRLYQTDPS GT YHAWKANAIGRS AKTVKEFIJEKNYTED A1ASDSE A 
40 IKLAIKALLEWQSGGKNIELAinaW 

EEEDEEEEEEVEEEEEE (SEQ ID NO: 91) 

tcaaaaggaggaamcagtgggtctctgtGTTcgggatggggtggtggtggtaagtagagatactaacagccctcaca 
gagttactcctctgctaaatgaactaatgtgtctcaggtgttctgggctggcagcagctgcgaagatg^ 
tctctgaggagatcagcagagataaataagtatgttATAto^^ 
45 tccttgataaaaataaaatatacccaaagcaatggacgaagaccttttggtatttctgccttaatt 

gt^tctcaagattgtatcagacagatccttctggtacttatcatgcttggaaggcaaatgcaataggccgaagtgctaaaact 
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gttcgagaatttctagaaaagaattacacagaagatgccatagcaagtgacagtgaagctatcaagttagcaataaaagctt 
tgctagaagttgtccagtctggtggaaaaaacattgaacttgctataataagaagaaatcaacctttgaagaagaaagagga 
ggaggaggaaagaaggaagaagaaagaagaagaaggaggagaagaagaagaggaggaggaggaagaggatgag 
gaggaagaggaggaggtggaagaggaagaggaagaa (SEQ ID NO: 44) 
5 ctgChr_9ctgl926 MOOSE14337 2164211. .21 64258, 21 64700.. 2 167526, 

2169222..2169361 ENGSLTWQELLRQTGKCSIPCLIDTGAQANETEETVRA 
HKLPTRPWSKSVIYGGVYPNK^ 
FTTLYDNNIEISSSKHTLSQMNKVSNIV^ 
PIKGLEFEVELTQENYRLPIRNYPI^PGK^ 
1 0 CP VMF WKKEGTLRMVVDYKPL^ 
LKSAYHLIRVRKGDEHKLAFRCPRGW 
VKESHWCYMDNILmSKSESEHVKIWK^ 
VKJ^IGYHISEKGFTPC^ 

QLTHPLNNLLKKDWWKWTPTQTQAIENIKQCLV 

15 DASDVAVGAVl^QKHDDDKYYPVGYYSAKMSKAQLNYSVSDK^ 
LKHWimYLESTffiPFKILTDHRmiGRITNESE^ 
YRPGSANHIADALSRIVDETEPIPKDSEDNSr^ 
DTKLLNLLNNEDKRVEEMQL^ 
KUHPGffiLLTNIILR^ 

20 PSERPWESLSMDFTTALPESSGYNALF^^ 
MFDQRVIAYFGNPKEIIADNDHIFT 
QTERTNQTVEKLIJEICVCSTHPNTWVDHISLVQQSY^ 
RYSPALSPLELPSFSDKTDENSQETIQVFQTVl^ 
IEEFQPGDLVMVKJR.TKTAFLYTNNRQTESQIMSELPFTIASKW 

25 VKDLFKENYK (SEQ ID NO: 92) 

gagaatggaagtcttacatggcaagaattactcagacaaacagggaaatgctcaataccctgtctaatcgatacgggcgct 
caagcaaatattataacagaagaaactgttcgagcacataaactgcctaccagaccctggtcaaaaagtgtgatatatggtg 
gagtttatccaaataagattaatcgcaaaacaataaaacttaacataagtctaaatggaatatcaatcaaaacagaattcttgg 
ttgtaaagaaattttcgcatccagctgctatctccttcacaacattatatgacaataacattgaaatatctagcagtaaacacac 

30 gctctctcaaatgaacaaagtttcaaatattgtcaaggaacctgagttaccagatatctataaagaattcaaagacattactgc 
agaaaccaatacggaaaagctaccaaagccaataaaagggttagaatttgaagttgaactaactcaagaaaactacagatt 
acctatcagaaattacccgctaccaccgggaaaaatgcaagcta.tgaatgatgaaatteatcaaggattaaaaagtggaatt 
atacgagaatctaaagccattaacgcctgtccagteatgttcgttccgaaaaaggaaggcaccttgagaatggtggttgact 
acaaacctttaaataagtatgtcaaacccaatatatatccgttaccacttattgaacaattacttgctaaaatacaaggt 

35 aatttttactaaacttgacctcaaaagtgcctatcacttgatacgagtaagaaaaggagatgaacataaacttgctttt 
cctcgtggagtttttgaatatctagtaatgrcttotggc^^ 

ggtgaagtcaaagaatcgcatgtagtatgttatatggataatattttaattcattcaaaatcggaatctgaacatgtaaaacatg 
ttaaagacgttctacagaaattaaaaaatgcgaacttaattatcaatcaagcaaaatgtgaatttcatcaatcacaagtaaaat 
ttatogggtatcaratttcggaaaaaggatttacgccttgte^ 

40 aatcgtaaagaattacgacaatttctaggttctgtcaattatctt^ 

tcaataatcttttgaaaaaggatgtacgctggaaatggacaccaacacaaacccaagcgatagaaaacattaaacaatgttt 
agmcgcctccggtgctecgacactttgatttcagtaaaaagattctactagaaactgatgcttcaga 
cgtattgtctcaaaaacatgatgatgataaatactatcctgttggatactattcagcaaagatgtctaaagcacaattaaattat 
agcgtatcggacaaagaaatgcttgcaatcattaagtctctcaaacattggagac^^ 

45 aaaattttaacagacxatcgaaacttaattggtcgcatte^ 

attatttttacaagacttcaacmgaaattaactacagacctggatcagcaaatcacatagctgat^^ 
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acgaaacagaaccaattccaaaagattcagaagacaatagtatcaactttgttaatcaaatctcgataaccgatgattttaaa 
aaccaagtggttacagaatatacgaatgatacaaaattgttgaatttactaaacaatgaagacaaacgagtggaagagaata 
tccaactcaaagatggcttactaattaacagtaaagaccaaatcttattacctaatgatactcagctgactaggacaattatta 
aaaagtatcatgaagaaggtaaattgattcatccaggcattgaacttcttacaaacattatattacgtagatttacgtggaaag 
5 gaataagaaaacaaatacaagaatatgtacagaactgccatacatgtcaaataaacaaatctaggaatcataaaccttatgg 
acctttacaaccaattcccccatcagaaagaccttgggaatctttatcaatggattttattacagcttta 
ataatgcacttttcgtggtagttgaccgattttcaaaaatggcaatcttagtaccttgtacgaaatccattacagcagagc^ 
cagctcgaatgtttgatcaacgagttattgcttamcggcaatccaaaagaaatcattgcagataatgatcatatttttacttcc 
caaacgtggaaagatttcgcacataaatataatttcgttatgaaattttcgttaccatacagaccacaaactgatggacaaact 

10 gagcgtacaaaccaaactgtggagaaattactaagatgtgtatgtagcacacatccaaatacatgggtagatcatatatccc 
tagtgcaacaatcttacaacaatgcgatacattcagcaactcaaatgacaccttttgagatagtacatcgctattcaccagctt 
tatcacctttagagttecctagctttagtgacaaaactgacgaaaactctcaggaaacgatccaagtatttcaaacagttaaa 
gaacacttgaatacaaacaacataaagatgaaaaagtatttcgatatgaaaatacaagaaattgaagaatttcaacctggag 
acctagttatggtcaaaagaacgaaaacaGCAttcttatacaccaataacagacaaacagagagccaaatcatgagtga 

15 actcccattcacaattgcttcaaagagaataaaatacctaggaattcaactgacaagggaagtgaaggacctcttcaagga 
gaactacaaa 

ctgchrl4_l MOOSE14342 3984761. .3985345 

GPRLAHGTTTLAFRFRH 

SGTSADCATWYRVLQRELRIJ^ 

20 CVATALCGWDRSGPEIJFYWSDGTRLQGDrFSVGSGSPYAYGVLDRGYRW 
MSTQEAYALARCAVAHATHRDAYSGGSVDmiVRESGWEHVS ggccccag 
actggcccacggcaccaccactctggccttccgcttccgtcatggagtcattgctgcagctgacacgcgttcctcctgtgg 
cagctatgtggcgtgtccagcctcatgcaaggtcatccctgtgcaccagcacctcctgggtaccacctctggcacctctgc 
cgactgtgctacctggtatcgggtattacagcgggagctgcggcttcgggaactgagggagggtcagctgcccagtgtg 

25 gccagtgctgccaagctcttgtcagccatgatgtctcaataccggggactggatctctgtgtggccactgccctctgcggct 
gggaccgctctggccctgagctcttctacgtctatagcgacggcacccgcctgcagggggacatcttctctgtgggctctg 
gatctccctatgcctacggcgtgctagaccgtggctatcgctacgacatgagcacccaggaagcctacgccctggctcgc 
tgcgccgtggcccacgccacccaccgtgatgcctattcagggggctctgtagaccttttccacgtgcgggagagtggatg 
ggagcatgtgtca 

30 ctg2019 MOOSE14348 50831031..50831624 

SIMSYNGGAIMAMKGKNRVAIA^ 
LAGLATDVQRVAQCLKFQLNLYEI^ 
YTDPVIAGLDIXTFKPFSCSIJDLIGFPMVT 

EPNMDPEHPFETISP AMLN AVD WG AGS GMG VHHITKKX^KITTRT tctattatgtc 
35 ctataacggaggagccatcatggccatgaaggggaagaaccgtgtggccatcgctgcagacaggcacttcgggatcca 
ggcccagatggtgaccacggactt(xaggagatcmcccatgggtggttggttg^catcggtctggccgggcttgccac 
tgacgtccagagagttgcccagtgcctcaagttccagctgaacctatatgagttgaaggaaggtcagcagatcaaacctta 
taccttcacgagcatggtggccaacttcttgtatgagaaaca 

cctgaagacctttaagcccttcagttgctctctagacxitcatcggcttccccatggtgactgatgacm 
40 ctetgccgaacaaatgtacggaatgtgtgagtcccte^ 

cccagccatgctgaatgctgtggactggggtgcagggtcaggcatgggagtcatcatccacatcaccaagaaggacaaa 

atcaccaccaggaca 
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MOOSE13873_ 
MOOSE13874 
MOOSE13895 
5 MOOSE13908 
MOOSE13930. 
MOOSE13940 
MOOSE13941 
MOOSE13943 

10 MOOSE13948 
MOOSE13952_ 
MOOSE13954 
MOOSE13975 
MOOSE13977 

15 MOOSE13980. 
MOOSE13982 
MOOSE14044 
MOOSE14065_ 
MOOSE14068 

20 MOOSE14071" 
MOOSE14072 
MOOSE14097 
MOOSE14111 
MOOSE14139 

25 MOOSE14190 
MOOSE14210_ 
MOOSE14212 
MOOSE14225 
MOOSE14231 

30 MOOSE14237 
MOOSE14238 
MOOSE14241 
MOOSE14247 
MOOSE14251 

35 MOOSE14254 
MOOSE14260 
MOOSE14265 
MOOSE14272 
MOOSE14278 

40 MOOSE14279 
MOOSE14295 
MOOSE14296 
MOOSE14301 
MOOSE14331 

45 MOOSE14335 



ctgchrllq_l 
ctgchrllq_l 
_ctgl0008 
ctgChr_13ctg3 
_ctgl5ctg27 
ctgChr_Xctg264 
ctgChr_Xctg264 
_ctgChr_Xctg26 
_ctg8ctg6 
ctgChr_lctg34 
ctg8ctg4 
ctg21fin2 
_ctgchrllq_J> 
ctgl7005 
_ctgCHR12_ll 
_ctg2013 
_ctgCHR12_3 
_ctgl0007 
ctgC22 
_ctg5007 
_ctgChr_lctg2 
_ctg2019 
_ctg2019 
_ctg_2 
_ctg22fin4 
~<*g_2 

_ctgCHR3_ll 
_ctg4012 
_ctg2019 
_ctg4012 
_ctgCHR12_3 
_ctgCHR12__2 
_ctg4012 . 
ctg4015 

^ctgchr7_ctgl8037 
_ctg2014 
ctgC20 

__ctgChr_lctgl09 
_ctgChr_6ctg20 
_ctgl lctgl5 
_ctgC31 
_ctg_2 
ctgl8ctg2 
_ctgl8ctg2 



Al Pepsin 
Al Pepsin 
CI Papain 

CI 2 Ubiquitin C-terminal hydrolase family 1 

CI 5 Pyroglutamyl-peptidase I 

C19 Ubiquitin C-tercninal hydrolase family 2 

CI 9 Ubiquitin C-terminal hydrolase family 2 

C19 Ubiquitin C-terminal hydrolase family 2 

CI 9 Ubiquitin C-terminal hydrolase family 2 

C19 Ubiquitin C-terminal hydrolase family 2 

CI 9 Ubiquitin C-terminal hydrolase family 2 

C19 Ubiquitin C-terminal hydrolase family 2 

C19 Ubiquitin C-terminal hydrolase family 2 

C19 Ubiquitin C-terminal hydrolase family 2 

CI 9 Ubiquitin C-terminal hydrolase family 2 

M12A Astacin 

M12B Reprolysin 

M12B Reprolysin 

M12B Reprolysin 

M12B Reprolysin 

Ml 3 Neprilysin 

Ml 4 Carboxipeptidase Clan 

M24A Methionyl aminopeptidase, type 1 

SI Chymotrypsin / trypsin 

SI Chymotrypsin / trypsin 

SI Chymotrypsin / trypsin 

SI Chymotrypsin / trypsin 

SI Chymotrypsin / trypsin 

SI Chymotrypsin / trypsin 

SI Chymotrypsin / trypsin 

SI Chymotrypsin / trypsin 

SI Chymotrypsin / trypsin 

SI Chymotrypsin/ trypsin 

SI Chymotrypsin / trypsin 

S 1 Chymotrypsin / trypsin 

SI Chymotrypsin / trypsin 

SI Chymotrypsin / trypsin 

SI Chymotrypsin / trypsin 

S 1 Chymotrypsin / trypsin 

S49 Endopeptidase IV (sppA) (E.coli) 

S49 Endopeptidase IV (sppA) (E.coli) 

S8 Subtilases 

Tl A Threonine Type Peptidases 
T1A Threonine Type Peptidases 
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MOOSE14342_ctgchrl4_l 

MOOSE14348_ctg2019 
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U22 Drosophila transposon 297 endopeptidase 
TIB Threonine Type Peptidases 
TIB Threonine Type Peptidases 
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Appendix III 



10 



PARKINSON'S DISEASE 



Locus 1 
7 



Locus 3 
51 



Marker:DlS231 



Lod:5.11 



CM RANGE of one LOD drop: 



MOOSE13952 C19 Ubiquitin C-terminal hydrolase family 2 

DISTANCE: 12.78 Mb 

Marker:DlS2842 Lod:l.26 CM RANGE of one LOD drop: 

MOOSE14278 SI Chymotrypsin / trypsin 

DISTANCE: -16.6 Mb 



15 



HYPERTENSION 



Locus 4 
17 



MarkenD 1 1 S4 1 02 Lod: 1 .5 



CM RANGE of one LOD drop: 



MOOSE14295 S49 Endopeptidase IV (sppA) (E.coli) 

DISTANCE: -9.83 Mb 



20 



25 



ANXIETY 

Locus 1 
14 



Marker:D9S1690 Lod:4.38 CM RANGE of one LOD drop: 

MOOSE14337 U22 Drosophila transposon 297 endopeptidase 

DISTANCE: -9.03 Mb 



COPD (CHRONIC OBSTRUCTIVE PULMONARY DISEASE) 



Locus 4 
20 



Marker:D19S884 Lod:2.9 



CM RANGE of one LOD drop: 



MOOSE14301 S8 Subtilases 

DISTANCE: -8.63 Mb 

MOOSE14190 SI Chymotrypsin / trypsin 

DISTANCE: -7.29 Mb 

MOOSE14212 SI Chymotrypsin / trypsin 

DISTANCE: -7.24 Mb 



30 Locus 5 
12 



Marker:D21S1884 Lod:3.7 



CM RANGE of one LOD drop: 



MOOSE13975 C19 Ubiquitin C-terminal hydrolase family 2 

DISTANCE: -5.37 Mb 
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ASTHMA 

Locus 2 Marker:D3S1546 Lod:3.4 CM RANGE of one LOD drop: 

17 

MOOSE14225 SI Chymotrypsin / trypsin 

5 DISTANCE: 8.894 Mb 

NTDDM (NON-INSULIN DEP. DIABETES) 

Locus 3 Marker:D12S79 Lod:3.6 CM RANGE of one LOD drop: 

12 

MOOSE1 3982 C19 Ubiquitin C-terminal hydrolase family 2 

10 DISTANCE: -6.67 Mb 

OBESITY 

Locus 3 Marker:D12S79 Lod:3.9 CM RANGE of one LOD drop: 

10 

MOOSE13982 C19 Ubiquitin C-terminal hydrolase family 2 

15 DISTANCE: -6.67 Mb 

Locus 4 Marker:D14S283 Lod:3.2 CM RANGE of one LOD drop: 

12 

MOOSE14342 TIB Threonine Type Peptidases 

DISTANCE: 0.824 Mb 

20 MIGRAINE (GENOME-WIDE SCAN ONLY) 

Locus 1 Marker:D2S347 Lod:2.8 CM RANGE of one LOD drop: 

45 

MOOSE 1 4265 SI Chymotrypsin / trypsin 

DISTANCE: 7.808 Mb 

25 Locus 2 Marker:D2S2321 Lod:2.0 CM RANGE of one LOD drop: 

25 

MOOSE141 1 1 M14 Carboxipeptidase Clan 

DISTANCE: -0.46 Mb 

MOOSE14348 TIB Threonine Type Peptidases 

30 DISTANCE: 9.374 Mb 

BIPOLAR (GENOME-WIDE SCAN ONLY) 

Locus 1 Marker:DlS434 Lod:3.3 CM RANGE of one LOD drop: 

25 

MOOSE14097 Ml 3 Neprilysin 

35 DISTANCE: -12.7 Mb 
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OP (OSTEOPOROSIS) 



Locus 2 
11 



Marker:DllS4102 Lod:2.95 



CM RANGE of one LOD drop: 



MOOSE14295 S49 Endopeptidase IV (sppA) (E.coli) 

DISTANCE: -9.83 Mb 



10 



15 



20 



MYOPIA 

Locus 1 
30 



Marker:D2S2215 



Lod:1.34 



CM RANGE of one LOD drop: 



MOOSE14265 SI Chymotrypsin / trypsin 

DISTANCE: 1.613 Mb 



ALZHEIMER'S DISEASE 



Locus 3 
4 



Marker:D13S789 



Lod:4.12 



CM RANGE of one LOD drop: 



MOOSE13908 C12 Ubiquitin C-terminal hydrolase family 1 

DISTANCE: -2.63 Mb 



PAOD (PERIPHERAL ARTERIAL OCCLUSIVE DISEASE) 



Locus 1 
3 



Locus2 
8 



Marker:D 1S2798 Lod:4.32 CM RANGE of one LOD drop: 

MOOSE13952 C19 Ubiquitin C-terminal hydrolase family 2 

DISTANCE: -8.16 Mb 

Marker:DlS2846 Lod:2.42 CM RANGE of one LOD drop: 

MOOSE 1 3952 C 1 9 Ubiquitin C-terminal hydrolase family 2 

DISTANCE: 1.451 Mb 



25 AMD (AGE-RELATED MACULAR DEGENERATION) 



Locus3 
14 



MarkerD 16S3057 Lod:2.84 CM RANGE of one LOD drop: 

MOOSE14272 SI Chymotrypsin/ trypsin 

DISTANCE: 7.521 Mb 



30 PROSTATE CANCER 



Locus2 
38 



Marker:D21S1916 Lod:1.21 



CM RANGE of one LOD drop: 



MOOSE1 3975 C19 Ubiquitin C-terminal hydrolase family 2 

DISTANCE: -11.7 Mb 
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MI (MYOCARDIAL INFARCTION) 

Locus2 Marker:D7S2513 Lod:1.75 CM RANGE of one LOD drop: 

22 

MOOSE14260 S 1 Chymotrypsin / trypsin 

5 DISTANCE: 0.603 Mb 

OSTEOARTHRITIS 

Locus2 Marker:D4S2999 Lod:3.8 CM RANGE of one LOD drop: 

11 

MOOSE14254 SI Chymotrypsin / trypsin 

10 DISTANCE: -2.48 Mb 

Locus4 Marker:D16S3255 Lod:3 CM RANGE of one LOD drop: 

9 

MOOSE14272 SI Chymotrypsin/ trypsin 

DISTANCE: 10.31 Mb 

1 5 ED-INFLAMMATORY BOWEL DISEASE 

Locus4 Marker:D16S3040 Lod:2.2 CM RANGE of one LOD drop: 

16 

MOOSE14071 M12B Reprolysin 

DISTANCE: -2.31 Mb 
20 MOOSE14296 S49 Endopeptidase IV (sppA) (E.coli) 

DISTANCE: 12.52 Mb 

LONGEVITY 

Locus2 Marker:DllS987 Lod:2.07 CM RANGE of one LOD drop: 

12 

25 MOOSE13874 Al Pepsin 

DISTANCE: -9.92 Mb 
MOOSE13873 Al Pepsin 

DISTANCE: -9.81 Mb 
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CLAIMS 

What is claimed is: 

1 . An isolated nucleic acid molecule comprising a protease gene, wherein the 
protease gene has a nucleotide sequence selected from the group of nucleic 

5 acid sequences as shown in Appendix I, or the complements of the group of 

nucleic acid sequences as shown in Appendix L 

2. A nucleic acid encoding a polypeptide, wherein the polypeptide has an amino 
acid sequence selected from the group consisting of the group of amino acid 
sequences as shown in Appendix L 

10 3\ An isolated nucleic acid molecule which hybridizes under high stringency 
conditions to a nucleotide sequence selected from the group of nucleic acid 
sequences as shown in Appendix I, or the complements of the group of 
nucleic acid sequences as shown in Appendix L 



4. An isolated nucleic molecule which hybridizes under high stringency 
15 conditions to a nucleotide sequence encoding an amino acid sequence 

selected from the group consisting of the group of amino acid sequences as 
shown in Appendix I. 

5. A method for assaying for the presence of a first nucleic acid molecule in a 
sample, comprising contacting said sample with a second nucleic acid 

20 molecule, where the second nucleic acid molecule comprises a nucleotide 

sequence selected from the group of nucleic acid sequences as shown in 
Appendix I, and hybridizes to the first nucleic acid under high stringency 
conditions. 
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6. A vector comprising an isolated nucleic acid molecule selected from the 
group consisting of: 

(a) the nucleic acid sequences as shown in Appendix I; 

(b) the complement of one of the nucleic acid sequences are shown in 
5 Appendix I; or 

(c) a nucleic acid encoding an amino acid molecule as shown in 
Appendix I; 

where the nucleic acid molecule is operably linked to a regulatory sequence. 

7. A recombinant host cell comprising the vector of Claim 6. 

10 8. A method for producing a polypeptide encoded by an isolated nucleic acid 
molecule, comprising culturing the recombinant host cell of Claim 7 under 
conditions suitable for expression of the nucleic acid molecule. 

9. An isolated polypeptide encoded by the nucleotide sequence of the group of 
nucleic acid sequences as shown in Appendix I, or the complements thereof. 

15 10. The isolated polypeptide of Claim 9, wherein the polypeptide has an amino 
acid sequence selected from the group consisting of the group of amino acid 
sequences as shown in Appendix I. 

1 1 . An isolated polypeptide comprising an amino acid sequence, wherein the 
amino acid sequence is greater than about 95% identical to an amino acid 

20 sequence selected from the group consisting of the group of amino acid 

sequences as shown in Appendix I. 

12. A fusion protein comprising an isolated polypeptide of Claim 2. 



13. 



A fusion protein comprising an isolated polypeptide of Claim 11. 
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14. An antibody, or an antigen-binding fragment thereof, which selectively binds 
to a polypeptide of Claim 2, or to a fragment or variant of said amino acid 
sequence. 

15. An antibody, or an antigen-binding fragment thereof, which selectively binds 
5 to a polypeptide of Claim 1 1, or to a fragment or variant of said amino acid 

sequence. 

16. A method of assaying for the presence of a polypeptide encoded by an 
isolated nucleic acid molecule according to Claim 1 in a sample, the method 
comprising contacting the sample with an antibody which specifically binds 

10 to the encoded polypeptide. 

17. A method of identifying an agent which alters the activity of a protease, the 
method comprising: 

(a) contacting a polypeptide of Claim 9, or a derivative or 
fragment thereof, with an agent to be tested; 
15 (b) assessing the level of activity of the polypeptide or derivative or 

fragment thereof; and 
(c) comparing the level of activity with a level of activity of the 

polypeptide or active derivative or fragment thereof in the absence of 
the agent; 

20 wherein if the level of activity of the polypeptide or derivative or fragment 

thereof in the presence of the agent differs, by an amount that is statistically 
significant, from the level in the absence of the agent, then the agent is an 
agent that alters activity of a protease. 



18. 

25 



An agent which alters the activity of a protease, identifiable according to the 
method of Claim 17. 
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The agent of Claim 18, where the agent is selected from the group consisting 
of: a protease gene binding agent; a receptor; a peptidomimetic; a fusion 
protein; a prodrug; an antibody; and a ribozyme. 

A method of altering activity of a polypeptide encoded by a protease gene, 
comprising contacting the polypeptide with an agent of Claim 19. 

A method of identifying an agent which alters interaction of the polypeptide 
of Claim 9 with a protease gene binding agent, comprising: 

a) contacting the polypeptide or a derivative or fragment thereof, 
and the binding agent, with an agent to be tested; 

b) assessing the interaction of the polypeptide or derivative or fragment 
thereof with the binding agent; and 

c) comparing the level of interaction with a level of interaction of the 
polypeptide or derivative or fragment thereof with the binding agent 
in the absence of the agent, 

wherein if the level of interaction of the polypeptide or derivative or 
fragment thereof in the presence of the agent differs by an amount that is 
statistically significant, from the level of interaction in the absence of the 
agent, then the agent is an agent that alters interaction of the polypeptide with 
the binding agent. 

An agent which alters interaction of a protease gene polypeptide with a 
protease gene binding agent, identifiable according to the method of Claim 
21. 

An agent which alters interaction of a protease gene polypeptide with a 
protease gene binding agent, selected from the group consisting of: a second 
protease gene binding agent; a receptor; a peptidomimetic; a fusion protein; a 
prodrug; an antibody; and a ribozyme. 
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24. A method of altering interaction of a protease gene polypeptide with a 
protease gene binding agent, comprising contacting the protease gene 
polypeptide and/or the protease gene binding agent with an agent of Claim 
23. 

5 25. A method of identifying an agent which alters expression of a protease gene, 
comprising the steps of: 

, a) contacting a solution containing a nucleic acid comprising the 
promoter region of the protease gene operably linked to a 
reporter gene with an agent to be tested; 
10 b) assessing the level of expression of the reporter gene; and 

c) comparing the level of expression with a level of expression of the 

reporter gene in the absence of the agent, 
wherein if the level of expression of the reporter gene in the presence of the 
agent differs, by an amount that is statistically significant, from the level of 
1 5 expression in the absence of the agent, then the agent is an agent that alters 

expression of the protease gene. 

26. An agent which alters expression of the protease gene, identifiable according 
to the method of Claim 25. 

27. A method of identifying an agent which alters expression of a protease gene, 
20 comprising the steps of: 

a) contacting a solution containing a nucleic acid of Claim 1 or a 
derivative or fragment thereof with an agent to be tested; 

b) assessing expression of the nucleic acid, derivative or fragment; and 

c) comparing expression with expression of the nucleic acid, derivative 
25 or fragment in the absence of the agent, 

wherein if expression of the nucleotide, derivative or fragment in the 
presence of the agent differs, by an amount that is statistically significant, 
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from the expression in the absence of the agent, then the agent is an agent 
that alters expression of the protease gene. 

The method of Claim 27, wherein the expression of the nucleotide, derivative 
or fragment in the presence of the agent comprises expression of one or more 
splicing variant(s) that differ in kind or in quantity from the expression of 
one or more splicing variant(s) the absence of the agent. 

An agent which alters expression of a protease gene, identifiable according to 
the method of Claim 27. 

An agent which alters expression of a protease gene, selected from the group 
consisting of: antisense nucleic acid to a protease gene; a protease gene 
polypeptide; a protease gene receptor; a protease gene binding agent; a 
peptidomimetic; a fusion protein; a prodrug thereof; an antibody; and a 
ribozyme. 

A method of altering expression of a protease gene, comprising contacting a 
cell containing a protease gene with an agent of Claim 30. 

A method of identifying a polypeptide which interacts with a protease gene 
polypeptide, comprising employing a two yeast hybrid system using a first 
vector which comprises a nucleic acid encoding a DNA binding domain and 
a protease gene polypeptide, splicing variant, or a fragment or derivative 
thereof, and a second vector which comprises a nucleic acid encoding a 
transcription activation domain and a nucleic acid encoding a test 
polypeptide, wherein if transcriptional activation occurs in the two yeast 
hybrid system, the test polypeptide is a polypeptide which interacts with a 
protease polypeptide. 
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33. A protease gene therapeutic agent selected from the group consisting of: a 
protease gene or fragment or derivative thereof; a polypeptide encoded by a 
protease gene; a receptor; a protease gene binding agent; a peptidomimetic; a 
fusion protein; a prodrug; an antibody; an agent that alters protease gene 

5 expression; an agent that alters activity of a polypeptide encoded by a 

protease gene; an agent that alters posttranscriptional processing of a 
polypeptide encoded by a protease gene; an agent that alters interaction of a 
protease gene with a protease gene binding agent; an agent that alters 
transcription of splicing variants encoded by a protease gene; and a 
10 ribozyme. 

34. A pharmaceutical composition comprising a protease gene therapeutic agent 
of Claim 33. 



35. The pharmaceutical composition of Claim 34, wherein the protease gene 
therapeutic agent is an isolated nucleic acid molecule comprising a protease 

15 gene or fragment or derivative thereof. 

36. The pharmaceutical composition of Claim 34, wherein the protease gene 
therapeutic agent is a polypeptide encoded by the protease gene. 

37. A method of treating a disease or condition associated with a protease in an 
individual, comprising administering a protease gene therapeutic agent to the 

20 individual, in a therapeutically effective amount. 

38. The method of Claim 37, wherein the protease gene therapeutic agent is a 
protease gene agonist. 



39. 



The method of Claim 38 wherein the protease gene therapeutic agent is a 
protease gene antagonist. 
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40. A transgenic animal comprising a nucleic acid selected from the group 
consisting of: an exogenous protease gene and a nucleic acid encoding a 
protease gene polypeptide. 

41 . A method for assaying a sample for the presence of a protease gene nucleic 
5 acid, comprising: 

a) contacting said sample with a nucleic acid comprising a contiguous 
nucleotide sequence which is at least partially complementary to a 
part of the sequence of said protease gene nucleic acid under 
conditions appropriate for hybridization, and 

10 b) assessing whether hybridization has occurred between a protease gene 

nucleic acid and said nucleic acid comprising a contiguous nucleotide 
sequence which is at least partially complementary to a part of the 
sequence of said protease gene nucleic acid; 
where if hybridization has occurred, a protease gene is present in the nucleic 

15 acid. 

42. The method of Claim 41, wherein said nucleic acid comprising a contiguous 
nucleotide sequence is completely complementary to a part of the sequence 
of said protease gene nucleic acid. 

43. The method of Claim 41 , comprising amplification of at least part of said 
20 protease gene nucleic acid. 

44. The method of Claim 4 1 , wherein said contiguous nucleotide sequence is 
100 or fewer nucleotides in length and is either: a) at least 80% identical to a 
contiguous sequence of nucleotides in one of the nucleic acid sequences as 
shown in Appendix I; b) at least 80% identical to the complement of a • 

25 contiguous sequence of nucleotides in one of the nucleic acid sequences as 

shown in Appendix I; or c) capable of selectively hybridizing to said protease 
gene nucleic acid. 
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45 . A reagent for assaying a sample for the presence of a protease gene nucleic 
acid, said reagent comprising a nucleic acid comprising a contiguous 
nucleotide sequence which is at least partially complementary to a part of the 
nucleotide sequence of said protease gene nucleic acid. 

5 46. The reagent of Claim 45 , wherein the nucleic acid comprises a contiguous 
nucleotide sequence which is completely complementary to a part of the 
nucleotide sequence of said protease gene nucleic acid. 

47. A reagent kit for assaying a sample for the presence of a protease gene 
nucleic acid, comprising in separate containers: 

10 a) one or more labeled nucleic acids comprising a contiguous nucleotide 

sequence which is at least partially complementary to a part of the 
nucleotide sequence of said protease gene nucleic acid, and 
b) reagents for detection of said label. 

48. The reagent kit of Claim 47, wherein the labeled nucleic acid comprises a 
1 5 contiguous nucleotide sequences which is completely complementary to a 

part of the nucleotide sequence of said protease gene nucleic acid. 

49. A reagent kit for assaying a sample for the presence of a protease gene 
nucleic acid, comprising one or more nucleic acids comprising a contiguous 
nucleotide sequence which is at least partially complementary to a part of the 

20 nucleotide sequence of said protease gene nucleic acid, and which is capable 

of acting as a primer for said protease gene nucleic acid when maintained 
under conditions for primer extension. 

50. The use of a nucleic acid which is 100 or fewer nucleotides in length and 
which is either: a) at least 80% identical to a contiguous sequence of 

25 nucleotides in one of the nucleic acid sequences as shown in Appendix I; b) 

at least 80% identical to the complement of a contiguous sequence of 
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nucleotides in one of the nucleic acid sequences as shown in Appendix I; or 
c) capable of selectively hybridizing to said protease gene nucleic acid, for 
assaying a sample for the presence of a protease gene nucleic acid. 

5 1 . The use of a first nucleic acid which is 1 00 or fewer nucleotides in length 
5 and which is either: 

a) at least 80% identical to a contiguous sequence of nucleotides in one 
of the nucleic acid sequences as shown in Appendix I; 

b) at least 80% identical to the complement of a contiguous sequence of 
nucleotides in one of the nucleic acid sequences as shown in 

10 Appendix I; or 

c) capable of selectively hybridizing to said protease gene nucleic acid; 
for assaying a sample for the presence of a protease gene nucleic acid that 
has at least one nucleotide difference from the first nucleic acid. 

52. The use of a nucleic acid which is 100 or fewer nucleotides in length and 
15 which is either: 

a) at least 80% identical to a contiguous sequence of nucleotides in one 
of the nucleic acid sequences as shown in Appendix I; 

b) at least 80% identical to the complement of a contiguous sequence of 
nucleotides in one of the nucleic acid sequences as shown in 

20 Appendix I; or 

c) capable of selectively hybridizing to said protease gene nucleic acid; 
for diagnosing a susceptibility to a disease or condition associated with a 
protease. 



WO 03/040393 



PCT/IB02/04615 



1/58 



SEQUENCE LISTING 



<110> deCODE genetics ehf . 

Martinez, Roger Alfonso Moraga 
Sigurdsson, Gunnar Thor 

<12 0> NUCLEIC ACIDS ENCODING PROTEASES 



<130> 2345.2035002 

<150> 60/332,633 
<151> 2001-11-06 

<160> 94 

<170> FastSBQ for Windows Version 4.0 

<210> 1 

<211> 1083 

<212> DNA 

<213> Homo sapiens 



<400> 1 

ctggtagatg 

atcggaactc 

gtgccctcag 

gattcttcca 

atgacaggca 

atcttcggcc 

atcctggggc 

atctggaacc 

aagagtggca 

aactgggtgc 

aacggagaga 

ctgctgaccg 

aactcagatg 

gtgccaggca 

gtggtcagct 

cagtaccccg 

ttccagggca 

atccgccagt 

get 



aacagcccct 
ctgcccagga 
tctactgctc 
cctaccagtc 
tecteggata 
tgagegagae 
tggcctaccc 
agggcctggt 
gcgtggtgat 
ctgttaccgt 
ccatcgcctg 
gcccaaccag 
gcgacgtgag 
gaagegaega 
gctcagccat 
tgccacccag 
tgaacgtccc 
actttaccgt 



ggagaactac 
tttcaccgtc 
cagtcttgcc 
caccagcgag 
cgacactgtc 
ggaacctggc 
cagcatttcc 
ttctcaggac 
ctttggtggc 
cgagggttac 
tgetgaggge 
ccccattgcc 
tccagccccg 
aaacccttct 
cagcagcctg 
tgcctacatc 
caccgaatct 
cttcgacagg 



ctggatatgg 
gtctttgaca 
tgcaccaacc 
acagtctcca 
caggttggag 
tccttcctgt 
tcctccgggg 
ctcttctctg 
attgactctt 
tggcagatca 
tgccaggcca 
aacatccaga 
actgccctgt 
aacttttctc 
cccgacatcg 
ctgcagagcg 
ggagagcttt 
gcaaacaacc 



agtacttegg 
ccggctcctc 
acaaccgctt 
tcacctacgg 
gcatctctga 
attatgetec 
ccacacccgt 
tctacctcag 
cttactacac 
ccgtggacag 
ttgttgacac 
gegacategg 
tctacactca 
accctcactc 
tcttcaccat 
aggggagctg 
ggatcctggg 
aggteggect 



cactatcggc 
caacctgtgg 
caaccctgag 
caccggcagc 
caccaatcag 
cttcgatggc 
ctttgacaac 
cgccgatgac 
tggaagtctg 
catcaccatg 
cggcacctct 
agecagegag 
agtagtgggt 
tttccagatg 
caafcggagtc 
catcagtggc 
tgatgtcttc 
ggcccctgtg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1083 



<210> 2 

<211> 1083 

<212> DNA 

<213> Homo sapiens 



<400> 2 

ctggtagatg 

atcggaactc 

gtgccctcag 

gattcttcca 

atgacaggca 

atcttcggcc 

atcctggggc 

atctggaacc 

aagagtggca 

aactgggtgc 

aacggagaga 



aacagcccct 
ctgcccagga 
tctactgctc 
cctaccagtc 
tecteggata 
tgagegagae 
tggcctaccc 
agggcctggt 
gcgtggtgat 
ctgttaccgt 
ccatcgcctg 



ggagaactac 
tttcactgtc 
cagtcttgcc 
caccagcgag 
cgacactgtc 
ggaacctggc 
cagcatttcc 
ttctcaggac 
ctttggtggc 
cgagggttac 
tgetgaggge 



ctggatatgg 
ctctttgaca 
tgcaccaacc 
acagtctcca 
caggttggag 
tccttcctgt 
tcctccgggg 
ctcttctctg 
attgactctt 
tggcagatca 
tgccaggcca 



agtacttegg 
ccggctcctc 
acaaccgctt 
tcacctacgg 
gcatctctga 
attatgetec 
ccacacccgt 
tctacctcag 
cttactacac 
ccgtggacag 
ttgttgacac 



cactatcggc 
caacctgtgg 
caaccctgag 
caccggcagc 
caccaatcag 
cttcgatggc 
ctttgacaac 
cgccgatgac 
tggaagtctg 
catcaccatg 
cggcacctct 



60 

120 

180 

240 

3O0 

360 

420 

480 

540 

600 

660 
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ctgctgaccg 
aactcagatg 
gtgccaggca 
gtggtcagct 
cagtaccccg 
ttccagggca 
atccgccagt 
get 



gcccaaccag 
gcgacgtgag 
gaagegaega 
gctcagccat 
tgccacccag 
tgaacgtccc 
actttaccgt 



ccccattgcc 
tccagccccg 
aaacccttct 
cagcagcctg 
tgcctacatc 
caccgaatct 
cttcgacagg 



aacatccaga 
actgccctgt 
aacttttctc 
cccgacatcg 
ctgcagagcg 
ggagagcttt 
gcaaacaacc 



gegacategg 
tctacactca 
accctcactc 
tcttcaccat 
aggggagctg 
ggatcctggg 
aggteggect 



agecagegag 
agtagtgggt 
tttccagatg 
caatggagtc 
catcagtggc 
tgatgtcttc 
ggcccctgtg 



720 

780 

840 

900 

960 

1020 

1080 

1083 



<210> 3 
<211> 915 
<212> DNA 

<213> Homo sapiens 



<400> 3 

gacaaggcat 

tgtctgatat 

tacagecaag 

gaagaattca 

ttccaggaac 

atgactcctg 

gctctggaag 

ctggtagact 

cccttccggt 

ggaaaggtta 

gtggacatcc 

tctgttgctg 

gagccacgct 

gaaggagcag 

gcaatcagta 

ttccccctta 



ggttgaaaag 
ttaaatgtaa 
ggaaacacag 
ggcaggtgat 
gcctgcttct 
tgaaggatca 
ggcagatgtt 
gctctgggcc 
atgttcagga 
aaacctgtag 
ettcaeggga 
ttggtgcaag 
gtgaccctga 
actcagataa 
tttacatacc 
ggcag 



agggaacaaa 
aaataaggac 
cttcacaatg 
gaatggtttt 
tgagatcccc 
gggtcagtgt 
ctggaaaaca 
tcaaggcaat 
gaaeggagge 
gtacaatccc 
gaaggacctg 
ccatgtcttc 
aggectggat 
caataaatat 
acaagccctt 



cagttcaatg 
gtgaagatga 
gecatgaacg 
caataccaga 
acatctgtgg 
ggctcttgtt 
ggcaaactta 
gagggctgea 
ctggactctg 
aagtattctg 
gcgaaggcag 
ttccagttct 
catgetatge 
tggctggtga 
aatcacattt 



aaggaaagga 
ttgagcagca 
cctttggaga 
agcacaggaa 
actggagaga 
gggcttfctag 
tctcactgaa 
atggtgactt 
aggcatccta 
ctgctaatga 
tggcaactgt 
ataaaaaagg 
tggtggttgg 
agaacaggga 
ctgaaatccc 



atctgataga 
caatcaggaa 
catgaccaat 
ggggaaacag 
gaaaggctac 
tgcaactggt 
tgagcagaat 
catggataat 
tccatatgaa 
cactggtttt 
ggggcccatc 
aatttatttt 
ctacagctat 
atcctttttt 
agaagtcttt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

915 



<210> 4 
<211> 570 
<212> DNA 

<213> Homo sapiens 



<400> 4 

tcaaatgtga agecaagaaa gaaaaatcca 
attggtgtca taaatcactg gaaattaccc 
ggaaaagagg gggaaaaacc acacaaaaca 
tcaaatttag gaagcattga aaatggaaca 
cttaaagtca ttttaaaaaa cgcaaacact 
cacaattata actgtttaca ctgcactgtt 
agtatagatg agaaagtaga tcttcatttt 
tatgaattag atgggcggaa gccatttcca 
ttattagagg atgecataga agtttgcaag 
agatttaatg egattgetet ttctgeagea 

<210> 5 

<211> 612 

<212> DNA 

<213> Homo sapiens 



aaagggttca ctgttaatgt cctggcaaac 6 0 
aaacttactc acaaagggga aaaaaatagt 12 0 
aacagtgeat ccaacagctg tgggacaatt 180 
aatgaaggta aatataaaat aaattttatt 24 0 
gttagaactg aaaagagaaa tagacaaatt 3 00 
gaaacatcag attatcaagt tcaagcacca 3 60 
attgeattag ttcatgtaga tgggcatctc 42 0 
attaaccatg gtgaaactag tgatgaaact 4 80 
aagtttatgg agcgcgaccc tgatgaacta 540 

570 



<400> 5 

gcatgtttgg tttcaggttt tgggcccttc 
gcagtgaagg agctctccaa gctgggcctg 
ctggagctgc ctgtagatta cagggaggct 
catcagccgc aactegtegt gcatgtgggc 
gaacagtctg gcaagaacca aggctacegg 
ggcggcgtgt gcctacctgg cagcccagac 
gtctgeaage gcgtagctgt ggagggtgtc 



cggcagcact tggtgaattc cagctgggaa 6 0 
gggaatgaaa cagtggtgca getgeggact 120 
aagcggaggg teaceggaat ctgggaagat 180 
atggacaccg ccgccaaggc gatcattctg 240 
gacgccgaca tccgcagctt ctggcccgag 3 00 
gtgctggagt caggggtctg catgaaggca 3 60 
gaegtgatet tttcccgaga tgcaggcaga 42 0 
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tacgtctgtg attataccta ttacctgtct ctgcatcatg gaaagggctg cgcggcactc 480 

atccatgtcc ctccactatc gcgcgggctc ccggccagcc tgctgggaag agccttgaga 54 0 

gtcatcatcc aggaaatgct ggaagaggca ggggagaaac aaaaagaggt gacggcttct 600 

ggcacatccc ac 612 

<210> 6 

<211> 1074 

<212> DNA 

<213> Homo sapiens 



<400> 6 

agaaaaaagt 

tttatgaatt 

tctgacaagc 

tcgctttttc 

catctgatat 

ttccttattg 

gaggccaata 

caatcagatg 

gacatcagtt 

gctgacagca 

ctacagtggt 

tgccaaagct 

tgttttcatc 

atctcctttc 

atgaaagaag 

gcagtgatta 

caaaaggacc 

ttactctaca 



cagtctatac 
gtattgtcca 
acaaatgtat 
atgctatgta 
ggatccatgc 
caatattaga 
accccaactg 
tcacatgtca 
tggacttgcc 
cagtgagcag 
ttacaaggcc 
accaggagtc 
tcaagcggtt 
ccttggagct 
gccagccacc 
atcaccatgg 
agtggttcag 
gtgaagggta 



tgtaggcctg 
ggcacttacc 
aatgacaagc 
ctctgggagc 
agaacattta 
cgtgctacat 
ctgtaactgc 
agcctgccat 
tggctcttgt 
ggatgaccac 
agagcaccta 
tactaaacag 
tgagcatgta 
ggacatgact 
aacagattgt 
aactttggaa 
ctgtgatgat 
tttactgttc 



agagggctaa 
catattcctc 
cccagcttgt 
cgaactcctc 
gcagggtaca 
agacacagca 
atcatagacc 
agtgtttcta 
gccacattcg 
ataccaggaa 
ggaagcagtg 
ctcacaatga 
ggcaaacaga 
ccgtttttgg 
gtgcccaatg 
agtggccacfc 
gccatcatca 
tatcacaaac 



tcaatcttgg 
tactgaaaga 
gtctggtctg 
acattcccta 
ggcagcagga 
aagatgatag 
aaatctttac 
ccaccataga 
attcccagaa 
tcccctcact 
ccaaaatcaa 
aaaaattacc 
ggcgaaagat 
cctctactaa 
agaataagta 
ataccagctt 
ccaaggctac 
agggtctaga 



gaacacttgt 
tttcttcctc 
tgaaatgtct 
taagttactg 
tgcccatgag 
tggtgggcag 
aggtggcctg 
cccatgctgg 
cccagagagg 
tacagactgt 
atgcaatagt 
cattgtggct 
taataccttt 
agagagcaga 
ttccttgttt 
catccggcaa 
cattgaggac 
gaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1074 



<210> 7 

<211> 1074 

<212> DNA 

<213> Homo' sapiens 



<400> 7 

agaaaaaagt 

tttatgaatt 

tctgacaagc 

tcgctttttc 

catctgatat 

ttccttattg 

gaggccaata 

caatcagatg 

gacatcagtt 

gctgacagca 

ctacagtggt 

tgccaaagct 

tgttttcatc 

atctcctttc 

atgaaagaag 

gcagtgatta 

caaaaggacc 

ttactctaca 



cagtctatac 
gtattgtcca 
acaaatgtat 
atgctatgta 
ggatccatgc 
caatattaga 
accccaactg 
tcacatgtca 
tggacttgcc 
cagtgagcag 
ttacaaggcc 
accaggagtc 
tcaagcggtt 
ccttggagct 
gccagccacc 
atcaccatgg 
agtggttcag 
gtgaagggta 



tgtaggcctg 
ggcacttacc 
aatgacaagc 
ctctgggagc 
agaacattta 
cgtgctacat 
ctgtaactgc 
agcctgccat 
tggctcttgt 
ggatgaccac 
agagcaccta 
tactaaacag 
tgagcatgta 
ggacatgact 
aacagattgt 
aactttggaa 
ctgtgatgat 
tttactgttc 



agagggctaa 
catattcctc 
cccagcttgt 
cgaactcctc 
gcagggtaca 
agacacagca 
atcatagacc 
agtgtttcta 
gccacattcg 
ataccaggaa 
ggaagcagtg 
ctcacaatga 
ggcaaacaga 
ccgtttttgg 
gtgcccaatg 
agtggccact 
gccatcatca 
tatcacaaac 



tcaatcttgg 
tactgaaaga 
gtctggtctg 
acattcccta 
ggcagcagga 
aagatgatag 
aaatctttac 
ccaccataga 
attcccagaa 
tcccctcact 
ccaaaatcaa 
aaaaattacc 
ggcgaaagat 
cctctactaa 
agaataagta 
ataccagctt 
ccaaggctac 
agggtctaga 



gaacacttgt 
tttcttcctc 
tgaaatgtct 
taagttactg 
tgcccatgag 
tggtgggcag 
aggtggcctg 
cccatgctgg 
cccagagagg 
tacagactgt 
atgcaatagt 
cattgtggct 
taataccttt 
agagagcaga 
ttccttgttt 
catccggcaa 
cattgaggac 
gaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1074 



<210> 8 

<211> 1071 

<212> DNA 

<213> Homo sapiens 



<400> 8 

agaatcacct ccagctttac gatcggttta agaggactca tcaatcttgg caacacgtgc 6 0 
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tttatgaact gcattgtcca ggccctcacc 
tctgacaggc accgatgtga gatgccgagt 
tcgctgtttc gggagttgta ttctggaaac 
cacctggtgt ggatacatgc ccgccattta 
ttcctcattg cagcgttaga tgtcctgcac 
gcggccaaca atcccaacca ctgtaactgc 
cagtctgatg tcacctgtca agcctgccat 
gacattagtt tggacttgcc tggctcttgc 
gagagcagtg tgaacgggga aagccacata 
cggaggttta cgaggccaga gcacttagga 
caaagctacc aggaatctac caaacagctc 
tttcatttca aacggtttga acattcagcg 
tcctttcctc tggagctgga tatgacgccg 
aatggacaat tgcagctgcc aaccaatagt 
gctgtggtta atcaccaagg aaccttggag 
cacaaggacc agtggttcaa gtgtgatgat 
gtactggaca gtgaagggta tttactgttc 

<210> 9 

<211> 924 

<212> DNA 

<213> Homo sapiens 



cacacgccga tactgagaga tttctttctc 12 0 
cccgagttgt gtctggtctg tgagatgtcg 18 0 
ccgtctcctc atgtgcccta taagttactg 24 0 
gcagggtaca ggcaacagga tgcccacgag 3 00 
aggcactgca aaggtgatga tgtcgggaag 3 60 
atcatagacc aaatcttcac aggtggcctg 42 0 
ggcgtctcca ccacgataga cccatgctgg 4 80 
acctccttct ggcccatgag cccagggagg 540 
ccaggaatca ccaccctcac ggactgcttg 600 
agcagtgcca aaatcaaatg tggtagttgc 660 
acaatgaata aattacctgt cgttgcctgt 720 
aaacagaggc gcaagatcac tacatacatt 7 80 
tttatggcct caagtaaaga gagcagaatg 84 0 
ggaaacaacg aaaataagta ttccttgttt 900 
agtggccact ataccagctt catccggcac 960 
gccgtcatca ctaaggccag tattaaggac 102 0 
tatcacaaac aggtgctaga a 1071 



<400> 9 

ctgagtagca ggagacctgc tgcggtgggg 
tacgagaacg cttccctgca gtgcctgaca 
tcccgggagc actctcaaac atgtcagcgt 
gctcacatca catgggccct ccacagtcct 
gctgctggct tccatagagg caagcaggaa 
gatgccatga aaaaggcatg ccttcccggc 
accaccctca tccaccaaat atttggaggc 
tgccacggga tttcagacac ttttgaccct 
gctcagagtg tcaagcaagc tttggaacag 
aatgcctatc attgcggtct ttgtctccag 
cacacttctg ccaaggtcct catccttgtc 
aaacttgcca agaatgtgca atatcctgag 
cagaacacag gacctcttgt ctatgtcctc 
tgtcacgatg gacattactt ctcttatgtc 
gatgatgcca aggtcactgc ctgtagcatc 
ctcttttaca tccagaagag tgaa 

<210> 10 

<211> 1011 

<212> DNA 

<213> Homo sapiens 



gctgggctcc agaatatggg aaatacctgc 60 
tacacaccgc cccttgccaa ctacatgctg 12 0 
cccaagtgct gcatgctctg tactatgcaa 180 
ggtcatgfcca tccagccctc acaggcattg 240 
gatgcccatg aatttctcat gttcactgtg 3 00 
cacaagcagg tagatcatca ctctaaggac 3 60 
tgctggagat ctcaaatcaa gtgtctccac 42 0 
tacctggaca tcgccctgga tatccaggca 4 80 
ttggtgaagc ccgaagaact caatggagag 54 0 
agggcgccag cctccaagac gttaacttta 6 00 
ttgaagagat tctccgatgt cacaggcaac 6 60 
tgccttgaca tgcagccata catgtctcag 72 0 
tatgctgtgc tggtccacgc tgggtggagt 7 80 
aaagctcaag aaggccagtg gtataaaatg 840 
acttctgtcc tgagtcaaca ggcctatgtc 900 

924 



<400> 10 

tgtgagaaga gagaaaactt gttaccattt 
tatcttaata gtatacttca ggtattatat 
aaaactagaa tagatgaaat ggaaattttt 
ggatatctac agcatgatgc acaggaagta 
acatgccaac tcctaaaaaa agaagaagta 
gaagaaatac ctcatccgaa agaggaaatg 
gtggagaaat tatttcaagg tcagctggta 
ttaacagaaa gaagagaaga ttttcaagac 
tccaaagtag aggagagttc tgaaaaaatg 
gcttcagtag aaaggattgt aggagaagat 
actgaagctg aacgaagtct tttgtttgac 
aagtgctttg ctgctagtgg aaaccttttc 
ggtggtggac tttccaagat caacactcct 
gaatggagca caaagccaac taacgacagc 
ggcattacaa ttagtagtgg gcattacact 



gtgggactga ataatctcgg caatacttgc 6 0 
ttttgtcccg gctttatgta ctgtatcttc 120 
atttataggg aactcaaccc tatgtatgaa 180 
ttacaatgta ttttgggaaa cattcaagaa 24 0 
aaaaatgtgg cagaattacc tactaaggta 3 00 
aatggtgaag aacaaattgg ttttgagcta 3 60 
ttaaggacgc gttgcttgga atgtgaaagt 4 20 
atcagtgtgc cagtacaaga agatgagctt 4 80 
aagaccctga gatgggcaat ttcacaattt 54 0 
aaatatttct gtgaaaactg ccatcattat- 6 00 
aaaatgcctg aagttataac tattcatttg 6 60 
tttttttcct cccaaaggtt tgattgttat 72 0 
ttattgacac ctcttaaatt gtcactagaa 7 80 
tatggattat ttgcggttgt gatgcatagt 840 
gcttctgtta aagagtatga ggggaagtgg 900 



WO 03/040393 



PCT/IB02/04615 



5/58 



ttgctttttg atgattctga agtcaaagtt actgaagaga aggactttct gaattctctt 960 
tccccttcta catctcctac ttctactcct tacttgctat tttataagaa a 1011 

<210> 11 
<211> 924 
<212> DNA 

<213> Homo sapiens 
<400> 11 

ctgagtagca ggagacctgc tgcggtgggg gctgggctcc agaatatggg aaatacctgc 60 

tacgagaacg cttccctgca gtgcctgaca tacacactgc cccttgccaa ctacatgctg 12 0 

tcccgggagc actctcaaac atgtcagcgt cccaagtgct gcatgctctg tactatgcaa 18 0 

gctcacatca catgggccct ccacagtcct ggccatgtca tccagccctc acaggcattg 24 0 

gctgctggct tccatagagg caagcaggaa gatgtccatg aatttctcat gttcactgtg 3 00 

gatgccatga aaaaggcatg ccttcccggc cacaagcagg tagatcatca ctgcaaggac 3 60 

accaccctca tccaccaaat atttggaggc tgctggagat ctcaaatcaa gtgtctccac 42 0 

tgccacggga tttcagacac ttttgaccct tacctggaca tcgccctgga tatccaggca 4 80 

gctcagagtg tcaagcaagc tttggaacag ttggtgaagc ccgaagaact caatggagag 54 0 

aatgcctatc attgcggtct ttgtctccag agggcgccgg cctccaacac gttaacttta 600 

cacacttctg ccaaggtcct catccttgtc ttgaagagat tctccgatgt cgcaggcaac 660 

aaacttgcca agaatgtgca atatcctgag tgccttgaca tgcagccata catgtctcag 720 

cagaacacag gacctcttgt ctatgtcctc tatgctgtgc tggtccacgc tgggtggagt 7 80 

tgtcacgacg gacattactt ctcctatgtc aaagctcaag aagtccagtg gtataaaatg 84 0 

gatgatgccg aggtcactgt ctgtagcatc atttctgtcc tgagtcaaca ggcctatgtc 900 

ctcttttaca tccagaagag tgaa 924 

<210> 12 
<211* 696 
<212> DNA 

<213> Homo sapiens 
<400> 12 

tatgatagaa aaagacagga caaagctccc gttgggctaa agaatgttgg caatacttgt 60 

tggtttagtg ctgttattca gtcattattt aatcttttgg aatttagaag attagttctg 120 

aattacaagc ctccatcaaa tgctcaagat ttaccccgaa accaaaaggc ttttttcttt 180 

tcacagcaag atgtgagtga gtttacacac aaattattag attggttaga agatgccttc 240 

caaatgaaag ctgaagagga gacggttggc aaggatgtgg agaaattgaa acccttgtgc 3 00 

agtgttggtg aggatatgaa atggtacagc cactgtggaa aacatttttg ttattgtttt 3 60 

atttcctttc agcattggtt tactgaatta ccacctgtgt taacatttga attgtcaaga 420 

tttgaattta atcaggcatt gggaagacca gaaaaaattc acaacaaatt agaatttccc 4 80 

caagttcctt atcgattaca tgccgtttta gttcacgaag gccaagctaa tgctgggcac 540 

tactgggcat atatttttga tcatcgtgaa agcagatgga tgaagtacaa tgatattgct 600 

gtgacaaaat catcatggga agagctagtg agggactctt ttggtggtta tagaaatgcc 660 

agtgcatact gtttaatgta cataaatgat aaggca 6 96 

<210> 13 
<211> 684 
<212> DNA 

<213> Homo sapiens 
<400> 13 

aatgactgga ggagagttga tggttggcca gttgggctga aaaatgttgg caatacatgt 60 
tggtttagtg ctgttattca gtctctcttt caattgcctg aatttcgaag acttgttctc 12 0 
agttatagtc tgccacaaaa tgtacttgaa aattgtcgaa gtcatacaga acagcagcaa 180 
gatgtgagtg aattcacaca caagctcctg gattggctag aggacgcatt ccagctagct 240 
gttaatacct tcggccagta tcctcttcag gtaaacggtt atcgcaactt agacgagtgt 3 00 
ttggaagggg ccatggtgga gggtgatgtt gagcttcttc cctccgatca ctcggtgaag 3 60 
tatggacaag agcgttggtt tacaaagcta cctccagtgt tgacctttga actctcaaga 420 
tttgagttta atcactcctg gggcagagac aagaaggatt caaaggcctt gcacacagtg 480 
ccttatcgct tgcatgcagt tcttgttcat gaaggacaag caaatgctgg acactattgg 54 0 
gcctatatct ataatcaacc ccgacagagc tggctcaagt acaatgacat ctctgttact 600 
gaatcttcct gggaagaagt tgaaagagat tcctatggag gcctgagaaa tgttagtgct 660 
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tactgtctga tgtacattaa tgac 684 

<210> 14 

<211> 768 

<212> DNA 

<213> Homo sapiens 



<400> 14 

tctggaagct ctccttccag ttcatggcct 
caatgtttgt ttatgttgca tttgctttcg 
aggatgagga ggcagcatgg tgctctggaa 
gaggagcgag cgcaggatgc cgacagtgtg 
agctgtacct tggatgaatg ttttcagttc 
gacgcctgga agtgtcctca ctgccaagtc 
tggacgctgc ctgacatcct catcatccac 
agaaacaagc tctccacgct ggtgaagttt 
gtggcccaga gaagcaccag ccctgaggca 
ccggactgcc tgcccaccag ttacccgctg 
aaccaccatg gcaacctgca aggtgggcah 
ggccagtggt acagttatga tgacagcacg 
accagagggg cttatatcct gttctatcag 

<210> 15 
<211> 921 
<212> DNA 

<213> Homo sapiens 



tctggcctcc ggtcgagctg tcccattttt 60 

aggtctcagt cattcctctg gccaagagtg 12 0 

tttcatagag tcctgttcgg gagcctccag 18 0 

tggcagcagc agcaggcgca tcagcagcac 24 0 

tacaccaagg aggagcagct ggcccaggat 300 

ctgcagcagg ggatggtgaa gctgagtttg 360 

ctcaaaaggt tctgccaggt gggcgagaga 42 0 

ccgctctctg gactcaacat ggctccccat 48 0 

ggactgggcc cctggccttc ctggaagcag 54 0 

gacttcctgt acgacctgta tgccgtctgc 600 

tacacagcct actgccggaa ctctctggat 66 0 

gtggaaccgc ttcgagaaga tgaggtcaac 72 0 

aagcggaaca gcatccct 76 8 



<400> 15 

cttccccctg cttttttttt agggcttgtg 
ttcatgaact ccctgctaca aggcctgtct 
tttatctttg aatctctact tccactctac 
catttatata ggcaacaaga tgctcacgaa 
gatgagcgag accgccagcc tcggtctcaa 
atggtctgca aacactgtga acaccagagt 
tcactaagta ttccagccgc cacatggggt 
cacttcatct catcagaatc agtgcgggat 
acttttgtta aacagttaaa actagggaag 
tgttttgagg actccgtgta tcctgcccct 
cacccgcagc tccctcagtg tctctgcatc 
ggcacgcctc tgaagcggca tgagcacgtg 
gcagttgtcg tccaccatgg agacatgcac 
ccaccttctg ccaggaaccc tctctcaact 
actgtccgca aggccagcct gcaggaggtc 
gagcgcgtcc tttccaggat g 



cctggccttg ttaatttagg gaacacctgc 60 
gcctgtcctg ctttcatcag ccttgcctta 120 
agctgcagtt ttatagccca ggagggaatt 180 
ttattccatg tcattacctc gtcattggaa 24 0 
catccttttc atggaagact cactagtaat 3 00 
cctgttcgat ttgatacctt tgatagcctt 3 60 
cacccattga ccctggacca ctgccttcac 420 
gttgtgtgtg acaactgtac aaagaggacc 480 
gtgagcccac actacacacc ctgttggctt 540 
gaaacaactc ggttctcccg atttctcttc 600 
cacctacagc ggctgagctg gtccagccac 660 
cactcctcca catacctctt ccggctgatg 720 
tctggacact ttgtcactta ccgacggtcc 780 
agcaatcagt ggctgtgggt ctccgatgac 84 0 
ctgtcctcca gcgcctacct gctgttctac 900 

921 



<210> 16 

<211> 948 

<212> DNA 

<213> Homo sapiens 



<400> 16 

atgctctcct cccccgattt ctacccagct 
ggctccttgg ccagggcctt agaactgatg 
tacgacaaca taatagatca gaatgaaagc 
gagagagaca gagagagaaa agagagaaga 
ggtgtcgtgg aggtcccctt cctgctctcc 
atcctggagg ctcttgcgga gtttgaacgt 
gaccagagag acttcatttc catcatcccc 
agtggaggga tgcaggtggt ctccctggcg 
gtccttcatg agctcatgca tgtgctgggc 
gaccgctata tccgtgtcaa ctggaacgag 
aagtctcgga gcagcaacat gctgacgccc 



tacccctctg caatgcaatc tctaagcctt 6 0 
acccagtact ttaacaattg gaactgggtc 120 
aaattgagta agtcaaggag agaagagata 180 
gagggagaca gggaaaaaaa gagacagaat 240 
agcaagtacg atgagcccag ccgccaggtc 3 00 
tccacgtgca tcaggtfctgt cacctatcag 3 60 
atgtatgggt gcttctcgag tgtggggcgc 4 20 
cccacgtgtc tccagaaggg ccggggcatt 4 80 
ttctggcacg agcacacgcg ggccgaccgg 54 0 
atcctgccag gctttgaaat caacttcatc 6 00 
tatgactact cctctgtgat gcactatggg 6 60 
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aggcttgcct tcagccggcg tgggctgccc 
cacatcggcc agcgatggaa cctgagtgcc 
ggctgcagcc caaaattaga aaaggtaaat 
acccagttag aaaacacaat tgttagtaaa 
ccaagaaata aatctgacag ctgtatcaaa 

<210> 17 

<211> 2175 

<212> DNA 

<213> Homo sapiens 

<40O> 17 

ttccagctct ggatatggtt aaggccttgc 
gggggtgtat ttcctggagg cagtctctcc 
tttaaagtgc tcttcctgga ccaccatttt 
atcagtccac attttaaaga gatcactcag 
gaagagtttg gaatatcaca atgctataca 
atccattgga cagtatcaga tttgtctcag 
tatatccaaa tcttgtctca aagacattcc 
tacattgaaa ttatggttac agctgatgct 
caaaactata tactgactct aatgtcaatt 
ggaaatttga tacacatagt agtggtaaaa 
ccagtcatta attttgatgg tgctaccaca 
cagaatgacc ttgatgatgt tcacccttcc 
gaagacattt gttcatctaa agagaaatgt 
atatgtgatc ctttacaaag ctgctttatt 
actatagccc atgagcttgg gcacacactt 
aaagaaatga aagttacaaa gtatcatgta 
ccttggagct ggtcaaactg tagtcggaaa 
ggggaatgtc ttcttgacaa accagatgaa 
ggatcacgat atgatggaaa caagcagtgt 
tgtccccata tagagaatat atgcatgcat 
aaaggctgtt tcactcaaca cgtgccacca 
cattgccgtc atgggctatg tgtaaacaaa 
tggggaccat gggaacctta cagttcttgt 
gcaaccaggc gctgtaatcg tcctgagcca 
aggatgaaat ttcgatcatg taatactgat 
gagaagcagt gctctgattt taatggtaaa 
gtgaggtggc ttccaagata cagtggcatt 
caggttgctg gaaccaatta tttctaccta 
tgtggaactg aaactcafcga catctgtgtt 
cacgtgttaa actccagtgc caagatagac 
tcatgcaaga caataacagg tgtcttcaac 
aagattcccg caggagcaac aaacgttgac 
gatgacagtt accttgcatt atctgacgct 
cttctaagta cgtcaaaaaa agaaatcaat 
agtggatcaa ataacgcagt tgaaagaatt 
attttgcagg tgttgtgtgt gggtaattta 
atccctttgg aagag 

<210> 18 

<211> 2100 

<212> DNA 

<213> Homo sapiens 

<400> 18 

cccagcctcc tgtcctgcct tcttagcttt 
ctgtcttgta aaggaagttg gattggaaca 
actgattttg tgcttcggtg gcagaatcga 
ccctatggag gccacagtag ccccatgact 
gatttttaca gcaaccttgt ggtgacagcc 
tgtcttgacc ttcccaacct gctgggcctg 



accatcacac cactttgggc ccccagtgtc 72 0 
tcggacatca cccgggtcct ccaactctac 7 80 
acagttaata taaaaatcat atttctgtac 840 
cataccatta taatagcaat cataaaggtc 900 
tatttgagga aaaatgaa 94 8 



cctgtgacct ggattcccag gtttcctggt 60 
cccttgcaca ttttggggac caaagcattt 12 0 
agactatata tggagcatag taatgacata 180 
attatcacat cfctttcaaga aataatagag 240 
tataataatc catctaaatc tgacattagg 300 
gtttttattc taagtagagc caaaagcctt 360 
aggaaaaaac gtcttatatc atatccaaga 42 0 
aaagtggttt ctgctcatgg atcgaatttg 48 0 
gttgcaacaa tctacaaaga tccaagtatt 540 
ttagttatga ttcaccgtga ggaggaagga 60 0 
ttaaagaact tttgttcatg gcaacaaact 660 
caccatgaca ctgctgttct tatcactagg 72 0 
aacatgttag gtttatcata tttaggtacc 780 
aatgaagaaa aaggactcat ttctgctttt 840 
ggtgttcaac atgatgataa tcctagatgt 900 
atggcccctg ctttaagttt tcacatgagt 960 
tatgttactg aattcctaga tactggttac 102 0 
gaaatatata atctgccttc agaacttcct 1080 
gagcttgcgt ttggtcctgg gtcacaaatg 114 0 
ctgtggtgca caagcacaga aaagcttcac 1200 
gcagatggaa cagactgcgg tcctggaatg 126 0 
gaaacggaaa cacgtcctgt aaatggtgaa 132 0 
tcaagaacat gtggaggcgg aatcgaaagt 13 8 0 
agaaacggag gaaattactg tgtgggccgc 144 0 
tcatgtccaa aaggcacaca agactttcga 150 0 
catttggaca tcagtggcat tccctctaat 1560 
ggcacaaagg atcgttgtaa actctattgt 162 0 
ttgaaggata tggttgaaga tggtactcct 1680 
caaggccagt gtatggcagc tggttgtgat 174 0 
aaatgtggag tgtgtggtgg ggacaactct 180 0 
agttctcatt atggttataa tgttgttgta 1860 
attcgtcagt acagctattc tggacaacca 192 0 
gaagggaatt ttcttttcaa tggaaatttt 1980 
gtgcaaggaa caagaactgt tattgaatac 2 040 
aatagtacta atcgacaaga gaaagaactt 210 0 
tacaaccctg atgtacatta ttccttcaat 2160 

2175 



ccccgcccag ggccagatat agcctggcag 60 
cagacacact cactcacagt ctcatctgcg 120 
atggttgagt atcccggagt tcctcagatg 18 0 
tttctccttt atggagatat tgccaacttt 240 
cctcctgtgg gttggacctc actctcttcc 3 00 
gtgggggacc agctgggcga cacagagcgg 3 60 
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aagcggcggc atgccaagcc aggcagctac agcatcgagg tgctgctggt ggtggacgac 42 0 

tcggtggttc gcttccatgg caaggagcat gtgcagaact atgtcctcac cctcatgaat 480 

atcgtgaatg agatttacca cgatgagtcc ctgggggttc atataaatat tgccctcgtc 54 0 

cgcttgatca tggttggcta ccgacagtcc ctgagcctga tcgagcgcgg gaacccctca 60 0 

cgcagcctgg agcaggtgtg tcgctgggca cactcccagc agcgccagga ccccagccac 6 60 

gctgagcacc atgaccacgt tgtgttcctc acccggcagg actttgggcc ctcaggtatg 72 0 

caagggtatg cacccgtcac tggcatgtgt caccccctga ggagctgtgc cctcaaccat 780 

gaggatggct tctcctcagc cttcgtgata gctcatgaga ccggccacgt gctcggcatg 84 0 

gagcatgacg gtcaggggaa tggctgtgca gatgagacca gcctgggcag cgtcatggcg 90 0 

cccctggtgc aggctgcctt ccaccgcttc cattggtccc gctgcagcaa gctggagctc 960 

agccgctacc tcccctccta cgactgcctc ctcgatgacc cctttgatcc tgcctggccc 1020 

cagcccccag agctgcctgg gatcaactac tcaatggatg agcagtgccg ctttgacttt 10 80 

ggcagtggct accagacctg cttggcattc aggacctttg agccctgcaa gcagctgtgg 114 0 

tgcagccatc ctgacaaccc gtacttctgc aagaccaaga aggggccccc gctggatggg 12 00 

actgagtgtg cacccggcaa gtggtgcttc aaaggtcact gcatctggaa gtcgccggag 1260 

cagacatatg gccaggatgg aggctggagc tcctggacca agtttgggtc atgttcgcgg 132 0 

tcatgtgggg gcggggtgcg atcccgcagc cggagctgca acaacccctc cccagcctat 13 8 0 

ggaggccgcc tgtgcttagg gcccatgttc gagtaccagg tctgcaacag cgaggagtgc 1440 

cctgggacct acgaggactt ccgggcccag cagtgtgcca agcgcaactc ctactatgtg 15 00 

caccagaatg ccaagcacag ctgggtgccc tacgagcctg acgatgacgc ccagaagtgt 15 60 

gagctgatct gccagtcggc ggacacgggg gacgtggtgt tcatgaacca ggtggttcac 162 0 

gatgggacac gctgcagcta ccgggaccca tacagcgtct gtgcgcgtgg cgagtgtgtg 16 80 

cctgtcggct gtgacaagga ggtggggtcc atgaaggcgg atgacaagtg tggagtctgc 174 0 

gggggtgaca actcccactg caggactgtg aaggggacgc tgggcaaggc ctccaagcag 18 00 

gcagctctca agctggtgca gatcccagca ggtgccaggc acatccagat tgaggcactg 18 60 

gagaagtccc cccaccgcat tgtggtgaag aaccaggtca ccggcagctt catcctcaac 192 0 

cccaagggca aggaagccac aagccggacc ttcaccgcca tgggcctgga gtgggaggat 1980 

gcggtggagg atgccaagga aagcctcaag accagcgggc ccctgcctga agccattgcc 2 040 

atcctggtga gccccactct tgatacacaa aatattaaag aacccaggca cagaccagac 2100 



<210> 19 

<211> 2247 

<212> DNA 

<213> Homo sapiens 



<400> 19 

ttgttgttgt 

tcagcgactc 

tcaggagttc 

aaattagaca 

cctcagcttc 

ctattggatt 

tctctctcaa 

atcgtagatg 

agctctgggc 

ctcgtggtgg 

attctcacag 

ataaacgtgg 

aaccatcatg 

aagaatggca 

aagaatgaac 

cgaagttgta 

tcagggcaca 

ggcaatatca 

agccgccagt 

cccaagcaag 

gacacacagt 

atttgcaaat 

gcagcagaag 

tttggggagc 

gaatgttccc 



tgttgttgtt 
aggaggctga 
gagaccagcc 
ggcatggttt 
tggcccagga 
gcttttatca 
ctgtcttatc 
ctcccaagcc 
gacccagaag 
cagacaagaa 
taatgaacat 
ttgtggtgag 
cagaccagtc 
agagacatga 
catgtgacac 
ccatcaatga 
actttggtat 
tgtctcccac 
atctcaagaa 
caggacagta 
gtaaatggca 
cactttggtg 
ggaccgtttg 
tcgggccccg 
ggacatgtgg 



gttgttgttt 
ggcagtagaa 
tggtcaacat 
aataaggaca 
acacaactac 
ttgtcacatc 
cagatattct 
tcccacagag 
atcagctgga 
aatggtggaa 
ggtttctggc 
cctaattctt 
tctgaatagt 
tcatgccatc 
tcfcagggfctt 
ggacacagga 
gattcacgat 
actgaccgga 
attcctcagc 
taaatatccg 
atfctggagca 
ccaccgagta 
tggcttgagt 
gcccatccac 
tggaggagtc 



ttgagacaga 
tcacttgaac 
ggtgaaaccc 
cgaaaaaatg 
agctcccctg 
aaagatttct 
ctaatatccc 
gacacctatc 
aaatcacaaa 
aagcatggca 
ctatttaaag 
ctggaacaag 
ttttgtcaat 
ttactaacag 
gcccccatca 
cttggccttg 
ggagaaggga 
aacaatggag 
acacctcagg 
gacaaactac 
aaagccaagt 
ggccacaggt 
atgtggtgtc 
ggccagtggt 
aagttccagg 



gttggcaggg 
ccaggaggcg 
tgtctctact 
aattcctcat 
cgggtcacca 
cttcttccct 
ttccaaatgc 
taaggtttga 
agggcctcaa 
agggaaatgt 
atgggactat 
aacctggagg 
ggcagtctgc 
gatttgatat 
gtggaatgtg 
ccttcaccat 
atccctgcag 
tgttttcatg 
cggggtgtct 
caggacagat 
tatgcagcct 
gtgagaccaa 
ggcaaggcca 
ccgcctggtc 
agagacactg 



acctataatc 
gaggttgcat 
gaaaatccaa 
ctcgccatta 
tcctcacgta 
tgtttcagtt 
tcttctgttc 
tgaatatggg 
tgtggaaacc 
caccacatac 
tggaagtgac 
attattgatc 
cctcattgga 
ttgttcttgg 
ctctaagtac 
cgctcatgag 
aaaggctgaa 
gtcttcctgc 
agtggafcgag 
ttatgatgct 
tggttttgat 
gtttatgccc 
gtgcgtaaag 
gaagtggtca 
caataacccc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 
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aagcctcagt 
attaaccctt 
agcaaacctt 
gatcgatgca 
aaagtgaaag 
tgtgaactag 
gtttgcaaag 
tgcctcaccc 
gaaatccagg 
tattacctca 
acgtttgaat 
aatgagacgc 
ctgcaagctc 



atggtggctt 
gcaatgaaaa 
tccgtggatg 
aactgtactg 
atggaactcc 
tgggatgtga 
gtgataattc 
tcttgaaata 
agctgcaggt 
ccgggggctg 
accagcgctc 
tgtgttgctc 
cgcctcctga 



attctgtcca 
tagcttggat 
gttctaccag 
caaggctgag 
ctgctcccca 
tcatgaacta 
aacttgcaag 
ttatccggtg 
ttcctccagt 
gagcatcgac 
tttcaaccgc 
tgtcgcccag 
gttcacg 



ggttctagcc 
tttcgggctc 
tggaaaccct 
aactttgaat 
aacaaaaatg 
ggctctaaag 
ttttataaag 
gtcctcattc 
tacctcgcag 
tggcctgggg 
ccggaacgtc 
gctggagggc 



gtatttatca 
aacagtgtgc 
atacaaaagt 
ttttttttgc 
atgtttgtat 
cagtttcaga 
gcctgttcaa 
cagctggcgc 
ttcgaagcct 
agttcccctt 
tgtacgcgcc 
agttgcgtga 



gctgtgcaat 
agaatataac 
ggaagaggaa 
aatgtccggc 
tgacggggtt 
tgcttgtggc 
gcaattctcc 
ccgaagcatc 
cagtcaaaag 
cgctgggacc 
agggcccaca 
tctcggctca 



<210> 20 

<211> 2166 

<212> DNA 

<213> Homo sapiens 



<400> 20 

ctcctcctct 

ctcagcgccc 

ctcggtcctc 

cgctctccca 

cacggagctt 

gaggttgacc 

agagcagtgg 

gacttccaca 

acgttgggaa 

tatcaaggct 

ttgtcaggca 

ctctcatgga 

gaactgaacg 

gaaaatatca 

ggaacaatag 

cagccaggac 

cagtctggat 

ctggatatat 

ggaatgtgta 

ttcaccattg 

atgtgcaaaa 

ttctcctggt 

atctgccttg 

ggagaattat 

tgcatgctgg 

actaaattta 

ggacagtgtg 

tggtcttctt 

ctctgcacca 

ctgaagctct 

tgtgccgagc 

caagtagaag 

ttttctttgt 

tgtatagatg 

gaagacgtct 

tacaccaagc 

aagaag 



ggcgctgccc 
cgctgacctc 
tgaccccccc 
ccgctcccgc 
cacctctcct 
acaggggcga 
ccgtgtccga 
tggatctgag 
agacaggcac 
ctttgcgatc 
tgatacgaac 
aactcggcag 
tggagacctt 
ccacctacgt 
gaggaaacat 
tggtgataag 
tgatggggaa 
gttcctggaa 
gtaaatatcg 
cccatgagtc 
agtccgaggg 
caccctgcag 
ctgatcagcc 
atgatgcaaa 
actttaaaaa 
tgccagcagc 
tgaaatatgg 
ggtccccatg 
accccaagcc 
gcaacagtca 
acaacagcag 
atcaggactt 
caaataaagt 
ggatatgtga 
gtggggtgtg 
accaccacac 



gctgtctccc 
gcctcctccc 
gccctcacct 
cgcccccgcc 
cactcttttg 
ttacgtgtcc 
ggttgagtct 
gacttccagc 
taagtctgtg 
acacagaaac 
agaagaggca 
agctgcccaa 
ggtggtggtc 
gctcacgata 
caacattgca 
tcaccacgca 
agatgggact 
gaatgagccc 
cagctgcacg 
tggacacaac 
caacatcatg 
ccgccagtat 
aaagcctgtg 
cacacagtgc 
ggcaaccctg 
agaaggcaca 
tgatgaaggc 
ctccaggacc 
atcgcatgga 
gaaatgtccc 
acgattcaga 
atgcaaactc 
caaagatggg 
gagagttgga 
taacgggaat 
caaccgtgag 



gccttccctc 
ctctgctctt 
cctcccctcc 
gccgcggctg 
atttcagaat 
catgaaatca 
cttcaccttc 
agcctagtgg 
cagactttac 
tcctcagtgg 
gattacttcc 
ggcagctcgc 
gacaaaaaga 
ctcaacatgg 
attgtaggtc 
gaccacacct 
cgtcatgacc 
tgtgacactt 
attaatgaag 
tttggcatga 
tcccctacat 
ctacacaaat 
aaggaataca 
aagtggcagt 
tggtgccatc 
atttgtgggc 
cccaagccca 
tgcggagggg 
gggaagttct 
cgggacagtg 
gggcggcact 
tactgtatcg 
actccatgct 
tgtgacaatg 
aactcagcct 
tactttagag 



tgctcccctc 
tgtccctgca 
tctctcccct 
ccactccgcc 
atgacctggt 
tgcaccatca 
ggctgaaagg 
ctcctggctt 
cgccagagga 
ccctttcaac 
taaggccact 
catcccacgt 
tgatgcaaaa 
tatctgcttt 
tgattcttct 
taagtagctt 
acgccatctt 
tgggatttgc 
atacaggtct 
ttcatgatgg 
tggcaggacg 
ttctaagcac 
agtatcctga 
tcggagagaa 
gtattggaag 
atgacatgtg 
cccatggcca 
gagtatctca 
gtgagggctc 
ttgacttccg 
acaagtggaa 
cagaaggatt 
cggaggatag 
tccttggatc 
gcacgattca 
ctgcctgcaa 



tcgcctccgc 
ctctcccctc 
gcccgccccg 
ccccgcgccg 
ctctgcctac 
gcggcggaga 
ccccaggcac 
tattgtgcag 
cttctgtttc 
ctgccaaggc 
tccttcacac 
actaaatgaa 
ccatggccat 
attcaaagat 
agaagatgaa 
ctgccagtgg 
actgactggt 
acccataagt 
tggactggcc 
agaagggaac 
caatggagtc 
cgctcaagct 
gaaattgcca 
agccaagctc 
gaaatgtgag 
gtgccgggga 
ctggtcggac 
taggagtcgc 
cactcgcact 
tgctgctcag 
gccttacact 
tgatttcttc 
ccgtaatgtt 
tgatgcfcgtt 
baggggtctc 
gccttgggca 



1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2247 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2166 



<210> 21 
<211> 2229 
<212> DNA 

<213> Homo sapiens 
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<400> 21 

atgtcctctt accgtcccag ggtcatatcc 
gctgtgacag tttctcagac tttcattgtt 
ttgacagtct tgaggattac tgatcaggtg 
atagcagcag ctgccaggat cctccagaac 
ttctaccagt ttgcatgcgg aggctggctg 
agatacagca tctttgacgt cctccgcgac 
gagaattcga ctgccaagga ccggccggct 
tgcatgaacc agagtgtgat agagaagcga 
gtggtgggag gctggccggt ggcgatggac 
gagctggagc ggcagctggc gctgatgaac 
ctcttcatct ggaacgacga ccagaactcc 
accttgggca tgccctcccg agagtactac 
gaagcctacc tgcagttcat ggtgtcagtg 
cccagggaca gctgcctggt gcaggaggac 
ctggccaagg ccacggtacc ccaggaggag 
atgggactgg aggagctgca aagccagttt 
atacaaactg tgctatcctc tgtcaaaatc 
tatggcatcc cctacctgca gaaccttgaa 
atacagaact acctggtctg gcgcctggtg 
ttcaaggaca cacgagtgaa ctaccgcaag 
cgctggcgtg aatgtgtggg ctacgtcaac 
tacgtcaggg aggcgttccc tggagacagc 
gtgcggacag tgtttgtgga gacgctggac 
aagaaggcgc aggagaaggc catgagcatc 
ctggaggaga tgaacaggcg cctggacgag 
ctgtactttg agaacagtct gcagaacctc 
cttcgggaaa aggtggaccc . aaatctctgg 
tactccccaa accgaaacca gattgtattc 
agcaaggagc agccacaggc cttgaacttt 
atcacgcacg gctttgacga caatggccgg 
tggtggagta acttcjtccac ccagcacttc 
tacggcaact actcctggga cctggcagac 
ggggaaaaca ttgctgacaa cggaggggtg 
atggcagagg gtggcaagga ccagcagctg 
ttcttcatca actatgccca ggtgtggtgc 
tccatcaaga cagacgtcca cagtcccctg 
gccttcgcag acacgttcca ctgtgcccgg 
cgcgtgtgg 

<210> 22 

<211> 1113 

<212> DNA 

<213> Homo sapiens 



agaatgccac atcttctcag gctcctcttg 60 
ttcttgtttg tgtttttgtt ttttgtgatc 120 
agcgaggtct gcaccacccc tggctgcgtg 18 0 
atggacccga ccacggaacc gtgtgacgac 24 0 
cggcgccacg tgatccctga gaccaactca 30 0 
gagctggagg tcatcctcaa agcggtgctg 3 60 
gtggagaagg ccaggacgct gtaccgctcc 42 0 
ggctctcagc ccctgctgga catcttggag 480 
aggtggaacg agaccgtagg actcgagtgg 54 0 
tcacagttca acaggcgcgt cctcatcgac 6 00 
agccggcaca tcatctacat agaccagccc 6 60 
ttcaacggcg gcagcaaccg gaaggtgcgg 72 0 
gccacgttgc tgcgggagga tgcaaacctg 7 80 
atggtgcagg tgctggagct ggagacacag 84 0 
agacacgacg tcatcgcctt gtaccaccgg 90 0 
ggcctgaagg gatttaactg gactctgttc 960 
aagctgctgc cagatgagga agtggtggtc 1020 
aacatcatcg acacctactc agccaggacc 10 80 
ctggaccgca ttggfcagcct aagccagaga 114 0 
gcgctgtttg gcacaatggt ggaggaggtg 12 00 
agcaacatgg agaacgccgt gggctccctc 12 60 
aagagcatgg tcagagaact cattgacaag 13 2 0 
gagctgggct ggatggacga ggagtccaag 13 80 
cgggagcaga tcgggcaccc tgactacatc 1440 
gagtactcca atctgaactt ctcagaggac 1500 
aaggtgggcg cccagcggag cctcaggaag 15 60 
atcatcgggg cggcggtggt caatgcgttc 1620 
cctgccggga tcctccagcc ccccttcttc 1680 
ggaggcattg ggatggtgafc cgggcacgag 174 0 
aacttcgaca agaatggcaa catgatggat 18 00 
cgggagcagt cagagtgcat gatctaccag 1860 
gaacagaacg tgaacggatt caacaccctt 192 0 
cggcaagcct ataaggccta cctcaagtgg 1980 
cccggcctgg atctcaccca tgagcagctc 2 04 0 
gggtcctacc ggcccgagtt cgccatccaa 2100 
aaggtactgg ggtcgctgca gaacctggcc 2160 
ggcaccccca tgcaccccaa ggagcgatgc 2220 

2229 



<400> 22 

atgactgcac ttgaccgtgc ttgtctttat 
gatataaaaa ataatggtca cttttatgta 
gactttattg tccatatcac tatcagtatt 
aagttccaaa cttttccaca ttttcctgtc 
tggatgagag agatcagtga gaagtacaag 
acctatgaga cccaccccat gtattatctg 
aaaatcattt ggatggactg tggaattcac 
caatggttcg tcaaagaaat tctacaaaac 
cttaggaacc tggacttcta tgtccttcca 
tggacaactg atcgtcttfcg gaggaaatcc 
gggacggatc tcaatcgaaa tttcaatgca 
tgccaagatc aaacattctg tgggacaggg 
gccagcttca tagagagcaa gaaggatgat 
gggcagttaa ttctcacacc ttacggctac 
atgattcaag ttggacagaa ggcagcaaat 
agagttggat cgagtgcaga tattttatat 



tggctgtttt tatttaaatt attagttatt 60 
actctcgcca actcaaaaca tcttagcctg 12 0 
ttggtcaaag ccattcaacg agtctctagg 18 0 
ttctactggg cccttcaaac tatctatgag 24 0 
gaagtggtga cacagcattt cctaggagtg 30 0 
aagatcagcc aaccatctgg taatcccaag 3 60 
gccagagaat ggattgctcc tgctttttgc 42 0 
cataaagaca actcaagtat acgcaagctc 48 0 
gttcttaaca tagatggtta tatctacact 54 0 
cgttcacccc ataataatgg cacatgtttt 600 
tcttggtgta gtattggtgc ctctagaaac 660 
ccagtgtctg aaccagagac taaagctgtt 72 0 
attttgtgct tcctgaccat gcactcttat 78 0 
accaaaaata aatcaagtaa ccacccagaa 84 0 
gcattgaaag caaagtatgg aaccaattat 900 
gcctcatcag ggtcttcaag agattgggcc 960 
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cgagacattg ggattccctt ctcatatacg tttgagctga gggacagtgg aacatatggg 1020 
tttgttctgc cagaagctca gatccagccc acctgtgagg agaccatgga ggctgtgctg 10 80 
tcagtcctgg atgatctaca aaaaaaccca tat 1113 

<210> 23 
<211> 954 
<212> DNA 

<213> Homo sapiens 



<400> 23 

cctaacacac 

gggttttgcc 

gctgcagttt 

acaggcattg 

gggcttcatc 

aaggttgaca 

aatgcctatc 

aacaacgtgc 

aacattgatg 

gtgggcaatg 

gaagcaattg 

cacataactc 

tcttactttc 

gaggagggca 

gtcctggagg 

cacacggttc 



aaaaccatat 
aagattgctt 
cttcagctca 
taccagactg 
aggcttgtca 
tgacaactga 
cctcacctct 
tctgtcatgg 
tcacagtcta 
tggacgaatg 
cagcttgcag 
atcagaatgg 
atggacatcc 
tggcattcac 
atgcatggac 
tgatcacgtc 



gcctctgtgt 
tagaaggaat 
tccggttcct 
gggagacagc 
gctggcccgc 
agagatagat 
a ggctatgga 
tattcctgac 
ttacaatggc 
tggtaaaaag 
agcaggggct 
ttttcaagtc 
agaaatttgg 
tatagagcca 
tgtggtctcc 
gaggggcgcg 



ctagagttag 
gaagatattt 
aagcacataa 
atagaagtta 
cacgtcctcc 
gctcttgttc 
ggttttccaa 
agtcgacctc 
taccatggag 
ttagtggagg 
cccttctctg 
tgtccacatt 
catcatgcaa 
atcatcacgg 
ctagacaatc 
cagatcctga 



gcatcaggag 
cacacagtat 
agaagccaga 
agaatgaaga 
tcttggctgg 
atcgggaaat 
aatctgtttg 
ttcaggatgg 
acacctctga 
ttgccaggag 
taattggaaa 
ttgtgggaca 
acgacagtga 
agggatcccc 
aaaggtcggc 
ccaaactacc 



ttatcactct 
agttttgccg 
ctatgtgacg 
tcagattcaa 
gaagagttta 
catcagtcat 
tacctctgta 
agatattatc 
aacatttttg 
gtgtagagat 
cacaatcagc 
tggaatagga 
tctacccatg 
tgaatttaaa 
gcagttcgag 
ccat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

954 



<210> 24 
<211> 711 
<212> DNA 

<213> Homo sapiens 



<400> 24 

aggatcgtgg 

gagaacaagg 

gctcactgct 

tacctcagcg 

cccctgtaca 

ctgcctttcg 

cccagcaaga 

ccagaggtgc 

tacggccatt 

tcctgccagg 

ctggctggca 

gcccgagtca 



gcggcatgga 
agcacttctg 
tcaatgagtt 
gctcggaggc 
acgcggacac 
gccggcacat 
agtgcctgat 
tgcagaaagc 
cactcactga 
gtgactcagg 
tcgtgagctg 
ccaggctacg 



agcatccccg 
tggggccgcc 
ccaagacccg 
cagcaccgtg 
ggccgacttt 
ccagcccgtg 
ctcaggctgg 
cactgtggag 
caggatggtg 
aggacccctg 
gggaatcggg 
tgactggatc 



ggggagtttc 
atcatcaacg 
acgaagtggg 
cgggcccagg 
gacgtggctg 
tgcctcccgg 
ggctacctca 
ctgctggacc 
tgcgctggct 
gtctgcgagg 
tgtgcggaag 
ctggaggcca 



cgtggcaagc 
ccaggtggct 
tggcctacgt 
tggtccagat 
tgctggagct 
ctgccacaca 
aggaggactt 
aggcactgtg 
acctggacgg 
agccctctgg 
cccggcgtcc 
ccaccaaagc 



cagccttcga 
ggtgtctgct 
gggtgcgacc 
cgtcaagcac 
gaccagccct 
catcttccca 
cctggtcaag 
tgccagcttg 
gaaggtggac 
ccggttcttt 
aggggtctat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

711 



<210> 25 

<211> 714 

<212> DNA 

<213> Homo sapiens 



<400> 25 

cgcattgttg 

gttcggggtc 

gcccactgct 

aaggtgtggc 

ctgcacccgt 

cacccggtgg 

ttcgagcccg 

atcagcaacg 

gtctatcgct 

gatgcctgtc 



gtggagctgt 

gacacafcctg 
tccaggagga 
agaactcgcg 
accacgaaga 
tgcgctcggc 
gcctgcactg 
ctctgcagaa 
accaggtgac 
agggtgactc 



gtcctccgag 
tgggggggcc 
cagcatggcc 
ctggcctgga 
ggacagccat 
cgccgtgcgc 
ctggattacg 
agtggatgtg 
gccacgcatg 
a 99 t ggtccg 



ggtgagtggc 
ctcatcgctg 
tccacggtgc 
gaggtgtcct 
gactacgacg 
cccgtctgcc 
ggctggggcg 
cagttgatcc 
ctgtgtgccg 
ctggtgtgca 



catggcaggc 
accgctgggt 
tgtggaccgt 
tcaaggtgag 
tggcgctgct 
tgcccgcgcg 
ccttgcgcga 
cacaggacct 
gctaccgcaa 
aggcactcag 



cagcctccag 
gataacagct 
gttcctgggc 
ccgcctgctc 
gcagctcgac 
ctcccacttc 
gggcggcccc 
gtgcagcgag 
gggcaagaag 
tggccgctgg 



60 

120 

180 

240 

3O0 

360 

420 

480 

540 

6O0 
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ttcctggcgg ggctggtcag ctggggcctg ggctgtggcc ggcctaacta cttcggcgtc 66 0 

tacacccgca tcacaggtgt gatcagctgg atccagcaaa ccatggccca gage 714 

<210> 26 

<211> 705 

<212> DNA 

<213> Homo sapiens 

<400> 26 

aggattgtgg gcggcagcgc agegggcegt ggggagtggc cgtggcaggt gagcctgtgg 60 

ctgcggcgcc gggaacaccg ttgeggggee gtgctggtgg cagagaggtg getgetgteg 12 0 

gcggcgcact gettcgaegt etaeggggae cccaagcagt gggeggcett cctaggcacg 18 0 

ccgttcctga gcggcgcgga ggggcagctg gagcgcgtgg cgcgcatcta caagcacccg 24 0 

ttctacaatc tctacacgct cgactacgac gtggcgctgc tggagctggc ggggccggtg 3 00 

cgtcgcagcc gcctggtgcg tcccatctgc ctgcccgagc ccgcgccgcg acccccggac 3 60 

ggcacgcgct gcgtcatcac eggctgggge tcggtgcgcg aaggaggctc catggegegg 42 0 

cagctgeaga aggeggcegt gcgcctcctc agegagcaga cctgccgccg cttctaccca 480 

gtgeagatea gcagccgcat gctgtgtgcc ggcttcccgc agggtggcgt ggacagctgc 54 0 

tcgggtgacg ctgggggacc cctggcctgc agggagcect ctggacggtg ggtgctaact 60 0 

ggggtcacta gctggggcta tggctgtggc cggccccact tcccaggtgt ctatacccgg 66 0 
gtggcagctg tgagaggctg gataggacag cacatccagg acaac 705 

<210> 27 

<211> 714 

<212> DNA 

<213> Homo sapiens 



<400> 27 

cgcatcatcg gaggcacaga caccctggag 
tttgttggat ctgcctactg tggtgcctca 
gcccactgtt ttcatggaaa caggctgtca 
atgtatgttc aggggaatgc caagtttgtc 
tactataaca gtcagacttt tgattatgat 
cctgagaccc tgaaacagct cattcagcca 
cgcagtgggg agaagtgctg ggtaactggc 
ggctccctcg ttctgeagea ageggaggta 
acctaeggga tcatcacttc teggatgetc 
gectgeaaag gagatteggg tggaccttta 
attttgactg gcattgttag ctggggacat 
tacacaaggg tgtcaaactt tgttccctgg 

<210> 28 

<211> 705 

<212> DNA 

<213> Homo sapiens 



gggggttggc cgtggcaggt cagcctccac 60 
gtcatctcca gggagtggct tetttctgea 12 0 
gatcccacac catggactgc acacctcggg 180 
tccccggtga gaagaattgt ggtccacgag 24 0 
attgetttge tacagctcag tattgectgg 300 
atatgeatte ctcccactgg tcagagagtt 3 60 
tgggggcgaa gacacgaagc agataataaa 42 0 
gagctcattg ateaaacget ctgtgtttcc 480 
tgtgcaggca taatgtcagg caagagagat 54 0 
tcttgtcgaa gaaaaagtga tggaaaatgg 60 0 
ggaagtggac gaccaaactt tcctggtgtt 660 
attcataaat atgtcccttc tctt 714 



<400> 28 

agaattgtcc aaggaaggga aacagctatg 
cagctcatag ggtcaggeca tcagtgtgga 
acagcagctc actgettttg gaaaaataaa 
gcaactataa caccacccgc agtgaaacga 
taccatagag aaacaaatga aaatgacatt 
ttttcaaata tagtccagag agtttgcctc 
acaagtgtgt tegtcacagg atttggatcc 
etteggcaag ccagagtgga aaccataagc 
gatggcctga taactccagg aatgttatgt 
tgtaagggag attctggtgg acctctggtt 
ggtatagtaa gtfcggggaca atcatgtgca 
gtaactaagt atcgagattg gattgectea 



gaaggggaat ggccatggca ggccagcctc 60 
gccagcctca tcagtaacac atggctgctc 12 0 
gacccaactc aatggattgc tacttttggt 180 
aatgtgagga aaattattct tcatgagaat 24 0 
gctttggttc agctctctac tggagttgag 3 00 
ccagactcat ctataaagtt gccacctaaa 3 60 
atfcgtagatg atggacctat acaaaataca 420 
actgatgtgt gtaacagaaa ggatgtgtat 4 80 
gctggattca tggaaggaaa aatagatgea 54 0 
tatgataatc atgacatctg gtacattgta 600 
cttcccaaaa aacctggagt ctacaccaga 6 60 
aagactggta tgaac 7 05 



<210> 29 
<211> 690 
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<212> DNA 

<213> Homo sapiens 



<400> 29 

cgcatcgtgg ggggcagcgc ggcgccgccc 
ctcggcgggc agcctctgtg cggcggcgtc 
gcgcactgct ttcttctgtg gactgtgacg 
gaggaggtgc cagtgaaccg catcctgccc 
aacgacctgg ccctggtgca gctgtggacg 
gtgtgcctgc cccaggagcc ccaggagccc 
tggggcgc.cc tcttcgaaga cgggcctgag 
ctgctcagca ccgacacctg ccgaagagcc 
ctctgcgccg ggtacctggc ggggggcgtt 
ctgacctgtt ctgagcctgg cccccgccct 
ggggacggct gcggggagcc agggaagccc 
gactggctcc aggagcagat gagcggtgag 

<210> 30 

<211> 699 

<212> DNA 

<213> Homo sapiens 



ggggcctggc cctggctggt gaggctgcag 6 0 
ctggtagcgg cctcctgggt gctcacggca 12 0 
ctggcagagg ggtcccgggg ggagcaagcg 18 0 
caccccaagt ttgacccgcg gaccttccac 24 0 
ccggtgagcc cggggggatc ggcgcgcccc 3 00 
cctgccggaa ccgcctgcgc catcgcgggc 3 60 
gctgaagcag tgagagaggc ccgtgttccc 4 20 
ctggggcccg ggctgcgccc cagcaccatg 4 80 
gactcgtgcc agggtgactc gggaggcccc 54 0 
agagaggtcc tgttcggagt cacctcctgg 600 
ggggtctaca cccgcgtggc agtgttcaag 66 0 

690 



<400> 30 

agaatagcat ctggagtcat tgcacccaag 
tatgataaca tccatcagtg tggggccacc 
gcacactgct tccagaagta taaaaatcca 
atcaaccctc ccttaatgaa aagaaatgfcc 
tctgcagcaa gagagtacga cattgctgtt 
gatgacatac gccagatttg tttgccagaa 
gtccacatca caggatttgg agcactttac 
gaagccagag tgaaaatcat aagtgatgat 
gatataaaac ctggaatgtt ctgtgccgga 
ggtgattctg ggggaccttt agtcacaagg 
attgtaagct ggggagataa ctgtggtcaa 
acttattacc gaaactggat tgcttcaaaa 

<210> 31 
<211> 741 
<212> DNA 

<213> Homo sapiens 



gcggcctggc cttggcaagc ttcccttcag 6 0 
ttgattagta acacatggct tgtcactgca 12 0 
catcaatgga ctgttagttt tggaacaaaa 18 0 
agaagattta ttatccatga gaagtaccgc 24 0 
gtgcaggtct cttccagagt caccttttcg 300 
gcctctgcat ccttccaacc aaatttgact 3 60 
tatggtgggg aatcccaaaa tgatctccga 42 0 
gtctgcaagc aaccacaggt gtatggcaat 4 80 
tatatggaag gaatttatga tgcctgcagg 54 0 
gatctgaaag atacgtggta tctcattgga 6 00 
aaggacaagc ctggagtcta cacacaagtg 660 
acaggcatc 6 99 



<400> 31 

cggattatag ggggcaccga agcacaagct 
attaaatatg gccgtgttct tgttcatgta 
gtcctcacag ctgcccactg cactaaagac 
attggaacta ataatataca tggacgctat 
atcattattc atccaaactt cattttggaa 
ttaaaaaaag cagtgaggta taatgactat 
ttccaaatcc tggacggaaa cacaaagtgt 
gaaggtaatt atggtaacgc tacaaatatt 
cgagagatgt gtaattctga gaggagttat 
gcaggtgatg aagatggagc ttttgatact 
tgctacttac cagaatataa aagatttttt 
tgtggtcgaa gaggttttcc tggtgtctat 
acagagcatt tcttccatgc a 

<210> 32 

<211> 759 

<212> DNA 

<213> Homo sapiens 



ggcgcatggc cgtgggtggt gagcctgcag 6 0 
tgtgggggaa ccctagtgag agagaggtgg 12 0 
gctagcgatc ctttaatgtg gacagctgtg 180 
cctcatacca agaagataaa aattaaagca 24 0 
tcttatgtaa atgatattgc actttttcac 3 00 
attcagccta tttgcctacc ttttgatgtt 3 60 
tttataagtg gctggggaag aacaaaagaa 42 0 
ttacaagatg cagaagtgca ttatatttct 4 80 
gggggaataa ttcctaacac ttcattttgt 54 0 
tgcaggggtg acagtggggg accattaatg 6 00 
gtaatgggaa ttaccagtfca cggacatggc 660 
attgggccat ccttctacca aaagtggctg 72 0 

741 



<400> 32 
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agaattagta 
tcagatgagc 
gcacactgcc 
gagtacagcc 
acccatcctg 
aaacacaaag 
aaagttgaac 
gaatattcaa 
actgtgctca 
cctgattggg 
ggtggtggaa 
tcagttcccg 
gagttgatgg 



gttggagaaa 
accacttctg 
tggacagcct 
tctttcagaa 
aatacaacag 
tcaagtttgg 
caggaattct 
atgtcctaca 
agagcatgaa 
gaatggacgc 
tctggattct 
taagaaacaa 
attttatcac 



ttcaacagtg 
tggaggaagc 
cagtgagaag 
ggataagcaa 
ccgtgaatat 
aaatgctgtt 
ttgcttatcc 
agaaatggaa 
cctccctccc 
ctgccagggg 
tgctgggata 
ccatgtgaag 
tcaaaacctg 



actggacatc 
ttgattcaag 
cagctgaaga 
gaacagaata 
atgagtcctg 
cagccaatct 

agtggatggg 

cttcccatca 
ctgggaagga 
gactctggag 
acttcctggg 
gcatcacttg 
ttcacaggt 



catggcaggt 
aagatcgggt 
atataactgt 
ttcctgtctc 
atattgcact 
gtcttcctga 
gcaagatttc 
tggatgacag 
ccatgctgtg 
gaccactggt 
tagctggttg 
gcattttctc 



ctccctaaaa 
tgttacagca 
gacttctggg 
aaaaattatt 
gctgtatcta 
cagcgatgat 
caaaacatca 
agcgtgtaat 
tgctggcttc 
ttgtagaaga 
tgctggaggt 
caaagtgtct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

759 



<210> 33 

<211> 723 

<212> DNA 

<213> Homo sapiens 



<400> 33 

gagatttggt 

agtgggcaac 

cactgtttta 

actcttccct 

gatgaacatc 

gatgtacatc 

gttgttacag 

cataaagcat 

agtatggtac 

cagggagact 

taccttgttg 

gtctacacac 

aaa 



caggggaaca 
actactgtgg 
aagtgacaaa 
atatgcaaca 
atgatgatat 
gagtttgtct 
gatggggaag 
ctgtgaagat 
aggatagagt 
ctggaggacc 
gagtagtgag 
agacagatgt 



ggggcagaat 
ggcatcattg 
aaatccaaaa 
tgatgttcaa 
cgcacttata 
tcctgaagcc 
actttcattt 
tactgataca 
gctatgtgct 
actagttcat 
ctggggaagg 
cttttttttt 



gatatggttt 
atcagtgaaa 
aactatactg 
caaattatta 
ctgctcacta 
acacagattt 
aatggtaaga 
aacacttgta 
gggtacatgg 
cctaattctc 
aatgaatgtg 
ttaaagtgga 



ggctctcaag 
gacacttggt 
tcagctttgg 
ttcatgaaga 
aaaaagfcttt 
ttccacctgg 
tcagtgaaaa 
atgctaaaga 
aaggaaatat 
taaatatttg 
gtgcaatcaa 
tcaaaagcac 



ccttaaaatg 
gactgcagct 
cacgaaagta 
ctacatccag 
atttaagaat 
tgaaggagtt 
cttaacatac 
agcctatcgt 
agacgcctgc 
gtatatttgg 
tagtccaggg 
aattgctctc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

723 



<210> 34 

<211> 693 

<212> DNA 

<213> Homo sapiens 



<400> 34 

aggatagtca 

agctcttcca 

tatactgtgt 

gtgtccaaaa 

aaactgtcct 

acaaagcagt 

agttcagata 

caggcttgtg 

atcaaggaag 

gattctggag 

agctggggat 

caaaaatgga 



gcatggaatc 
ttcagggaag 
ggctaggatc 
tcgtcatcca 
ctcaagtcac 
tggcaattcc 
gagattacca 
aacagctcta 
acaagatttg 
ggcctctgtc 
tagaatgtgg 
ttaatgccac 



taagaaggga 
caggaaagat 
gattacagta 
tcccaagtac 
cttcacttct 
acccttttgt 
ttctgccctt 
caatcccatc 
tgctggtgat 
gtgtcacatt 
taaatctctt 
tatttcaaga 



aaagtccaat 
aaggccataa 
ggtgactcaa 
caagatacaa 
gccatcctgc 

tgggtgaccg 

caggaagcag 
ggtatcttct 
actcaaaaca 
gatggtgtat 
cctggagtct 
gcc 



ggctagtggt 
agacctggac 
ggaaacgtgt 
cggcagacgt 
ctatttgctt 
gatggggaaa 
aagtacccat 
tgccagcacb 
tgaaggatag 
ggatccagac 
acaccaatgt 



cctgtttggc 
tactttttca 
gaagtactac 
cgccttgttg 
gcccagtgtc 
agttaaggaa 
tattgaccgc 
ggagccagtc 
ttgcaagggt 
aggagtagta 
aatctactac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

693 



<210> 35 

<211> 669 

<212> DNA 

<213> Homo sapiens 

<400> 35 

ttggccttta atccagatta cacagtcagc tccactcccc cttacttggt ctatttgaaa 6 0 
tctgactact tgccctgcgc tggagtcctg atccacccgc tttgggtgat cacagctgca 12 0 
cactgcaatt taccaaagct tcgggtgata ttgggggtta caatcccagc agactctaat 180 
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gaaaagcatc tgcaagtgat tggctatgag 
acttctattg atcatgacat catgctaatc 
tatgtgaaat tagccaacct gccctaccaa 
tctacctgga gctacaatgt gtacaaagag 
gtaatctcca agcctcagtg tcgcgatgcc 
ctgtgtgtgg gcattgtgcc aggaaggagg 
gcaatctgca atgggatgct tcaaggaatc 
gccgatgttg gcatctatgc caaaattttt 
caaaataac 



aagatgattc atcatccaca cttctcagtc 24 0 

aagctgaaaa cagaggctga actcaatgac 300 

actatctctg aaaataccat gtgctctgtc 3 60 

cccgattcac tgcaaactgt gaacatctct 42 0 

tataaaacct acaacatcac ggaaaatatg 480 

cagccctgca aggaagtttc tgctgccccg 54 0 

ctgtcttttg cggatggatg tgttttgaga 60 0 

tactatatac cctggattga aaatgtaatc 660 

669 



<210> 36 

<211> 669 

<212> DNA 

<213> Homo sapiens 



<400> 36 

aggtgggcag ccggggtgag ggtgccggcc 
tccactaacc catctgatta ccggatcctg 
gagcacagca agcagatgac agtgaataag 
caccgcatgg ggagtgacat caccctgctg 
cacatcctcc ccgcctgcct tccggaacca 
tggatatctg gttggggaat ggtcaccgag 
caggaggcag aggtcggtgt catggacaac 
taccccggcc agccaagcag cagtgactac 
gacctcataa caggaaaggc catttgccga 
ttaaatggca cctggttcct gatggggctg 
gtcggtccca gggtcttcac caggctcccc 
agggagagc 

<210> 37 
<211> 609 
<212> DNA 

<213> Homo sapiens 



cagcattcag aggagcctcc ccacaacagg 60 
cttgggtatg accagcaaag ccatcccaca 12 0 
atcatggtgc acgctgacta taacgagttg 18 0 
cagctgcacc atcatgtgga attcagctcc 240 
accacgtggc tggcccctga cagctcctgc 30 0 
gatgtcttcc tgcctgagcc cttccaactt 360 
actgtctgcg gatccttttt ccagccccag 420 
accatccacg aggacatgct gtgcgctggg 480 
cgagactcca ggggtcccct cgtctgccca 54 0 
tctagttgga gcctcgactg ctgctcaccc 600 
tact t caeca actggatcag ccagaagaag 660 

669 



<400> 37 

agagttgttt ctggatactt ttcagcaaac 
ttacatttta accactgcat tcatgatctg 
aaattccatc atactataaa gattatttgc 
tttttgcaaa tttcttccat ttggaatggg 
gtttcctaca agaatgetag tatttgtagg 
cttttcccca tgaactttcc cttgtgtgcc 
gagcacaccg agtttggcta ccaacccgaa 
ttcgaggagg gcaagaagga tgcctgcaag 
gtgggtcagt cgtggctgca ggcgggggtg 
aaccgcccag gtgtctacat ccgtgtcacc 
cccaaactg 

<210> 38 
<211> 705 
<212> DNA 

<213> Homo sapiens 



atggtttcta ctccctggag aacaggcatt 60 
agecaaacag tcctggggga tcatttagtt 12 0 
catatattag ateatgetgt ggcccttttg 18 0 
aacatttacc caatacctct acctgeattt 240 
atcatgttgt ggggacatgc tggggacatg 3 00 
cgcgtggaca gacaacaggg ggagcagtgc 36 0 
accatcaaga atgacatget gtgcgccggc 42 0 
ggegactegg gcggccccct ggtgtgcctc 4 80 
atcagctggg gtgagggctg tgcccgccag 54 0 
gcccaccaca actggatcca teggatcate 60 0 

609 



<400> 38 

catataatca atggtaaaag acagatagct 
tgtccacttt tgetatttet atecaatgea 
gactccaact caaaaaaaaa aaaaaaaaaa 
gttatcaaaa cagatggeca ctataaagaa 
aagatgtgga gaaattggaa ectttatgea 
attactgagc caggcagggt ggctcacgcc 
gggtggatca cgagatgggg gtctcactat 
ctgeaggaga tgcagctccc gctgatcctg 
atgtcctaca tcatgcccga catgctgtgt 



ttcccccgaa gaccaggaac aagagaagga 60 
cactgcactc cgccatgggc aacagagcaa 12 0 
gagacagaga aaacaattcc aaaagctaca 18 0 
aacaaaaaca gaaaacatca agtgttggca 24 0 
ctgttggttt tetgeaagat taaacataga 3 00 
tgtaatccca gcactttggg aggecgagge 360 
gttgeccagg ctggtgagac ctcagacgag 42 0 
gagccctggt gccacctgct ctacggacac 4 80 
gctggggaca tectgaatge taagaccgtg 54 0 
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tgtgagggcg actccggggg cccacttgtc tgtgaattca accgcagctg gttgcagatt 60 0 
ggaattgtga gctggggccg aggctgctcc aaccctctgt accctggagt gtatgccagt 660 
gtttcctatt tctcaaaatg gatatgtgat aacatagaaa tcacg 705 



<210> 39 
<211> 684 
<212> DNA 

<213> Homo sapiens 



<400> 39 
agggtctctg 
aggaagaggc 
ctgctgtgtg 

aggagacttg 

aaaatatttg 
agtcctgccg 
gtcgccgacc 
gctggctttc 
gagtttctga 
attgacagtt 
tgtggagatt 
attcaagaaa 



gaggtaggga 
ctgagaaccc 
ctaggcacag 
aatctcattg 
gaaaatacac 
tcatcactga 
agactgaatg 
tcaaggaagc 
atggaagagt 
gcaaggtaag 
gggttaggtc 
atatcaaaga 



cagtgtccca 
tcacatgtgt 
gattcaaaaa 
gtacaaaggg 
cagcaacata 
caaagtaatc 
ttacatcact 
ccagctccct 
caaatccact 
aaaagatcaa 
cccacatttt 
aaat 



tctttggtac 
ggaggtttcc 
tccatgacgt 
aaaagaaaga 
aattacgaca 
ccagcttgtc 
gactggggag 
gtgattgaga 
gagctctgtg 
gagaccaaag 
tatacataca 



catccaccaa 
tggcctcaaa 
ctgctcatag 
caaggagtaa 
taagtctgct 
tgccatcccc 
aaacccaagg 
atgaagtgtg 
ctgggcattt 
ttagtctttt 
tacacacata 



tgcctacaac 
cattgagcac 
gtcaaaggtt 
agagaaaagg 
gggtttggcc 
aaattatgtg 
tacctttggg 
caatcgctat 
ggctggaggc 
tggtatagga 
cacaccgtcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

684 



<210> 40 
<211> 756 
<212> DNA 

<213> Homo sapiens 



<400> 40 

agaaagctag gcatcttaaa ccaccaggta 
cattttattg gatataaatc atattccgaa 
atggatcccc gtccatcacg cagctatcag 
tccggcacgc tggtcagtaa aacccgtgcg 
aacgggatca ttgctcgcct gcagcaggca 
ctggatatgg atacgccggg tggaatggtg 
gcccgtatgc gcgatatcaa acccatctgg 
ggtcagctta ttgccagttc ggcatcgcga 
tccattgggg tcatgatggc gcacagtaac 
cacatgcaca catatgttta ttgcagcact 
caaatgccca tcaataatag actggataaa 
ctgtgcagtc ataaaaagga tgagttcatg 
accatcattc tcagcaaact aacacaggaa 



ctattttggt ataatctatc acttctgtta 60 
ccgctggcgc tgtttggtga ggatgatgac 12 0 
gtggcaaatg gtatcgcggt cttgccggtt 180 
cttcagcctt attccgggat gacgggttac 24 0 
atcagtgacc ccggcgttga cggcattctt 3 00 
tccggggcgt ttgactgcgc cgacattatt 3 60 
gcgctggcca atgacatgaa. ctgcagtgca 42 0 
cggctggtca cacaaacggc cagaaccggc 4 80 
tatggcgctg cgctcaaaac taacggcggg 54 0 
attcacaata gcaaagactt aaaaccaacc 6 00 
gaaaatgtgg cacatataca ccatggaata 66 0 
tcctttgcag gaacgtggat gaagctagaa 720 
cagaaa 756 



<210> 41 
<211> 897 
<212> DNA 

<213> Homo sapiens . 



<400> 41 

atgctggggg tgcttcttca aatctggagg 
ggccggaggg agagaagccg ccaggcagcc 
ctgttattgt atcttagagc tgaactggaa 
agactggtga tgagggggct tgtcctgggg 
cttccgaaag atgtgcgtgc tgatttccag 
gccgaaaagg tttccgctta taccggcatg 
gcagtattct ccggccagga atctttggat 
accgatgcgc tcggcgtgat gcgcgaagca 
ggaactatgc catcaccttc tgcatcagct 
actcagacaa ctgcatcagc tgaacaggcc 
gcagcccctg tagatgtcag tgcgcaggtt 
atcatgggca tcctgaactg cgacgaggct 
gccgaaacgc cgggtatgac ggtagagagc 



gggagctgga agaagcagac acaggcccag 6 0 
ggggctgtga gtgccggagg ccgacgcgca 12 0 
gacaaactgg cctgtgtgga cagcaggctt 180 
agggcctcag gcagctctgt tagacccaaa 2 40 
acgcgtatcg atgccactcg tcagatgttt 3 00 
tctgttcagg acgtgctgga caccgaagcg 3 60 
aacgggctgg cggatgaact tgttaacaat 420 
ctcgacagac gcaaaaaaac aacccttgga 4 80 
gtgaccacta agccagttga ccaggcagca 54 0 
actaccgttg acacgacaat tgcttccgta 6 00 
actgcagcag tagctgcaga gaatagtcgc 660 
aaagggcgtg agtcacaggc gcgagcactg 72 0 
gcacagcgca ttctggctgc tgcaccgcaa 7 80 
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agtgcccaga tgcgtaccga tacggcgctg gatcgtttga tggaaacagc acccggtgca 84 0 
ctccaagcag gtagcgcatc ttctgatgcc gctgacgatt tgttaaacac ccccgtt 8 97 

<210> 42 

<211> 1389 

<212> DNA 

<213> Homo sapiens 



<400> 42 

acggacccct 

atcctgcagg 

gatggcatcg 

gacttcaatg 

cacgggaccc 

ggggtcgctt 

gtcatcgagg 
tggggtcccg 
ttccggcgtg 
ggcaacggcg 
acgctttccg 
gcctccaccc 
acggacctgc 
gcggccggca 
cagcacctgg 
aacggcgtgg 
ctggtggaca 
gtccagagcc 
tgcgccggcc 
tcctacagcc 
acactcgtgg 
atgtccaccc 
aagggctact 
tctccaacc 



ggttctccaa 
cctggagtca 
agaaggacca 
actacgaccc 
gctgtgctgg 
tcaacgcccg 
cccagtcgct 
aggacgacgg 
gtgtgaccaa 
gcctgcacta 
tgggcagcac 
tcaccaccac 
atcacgggtg 
tgatcgccct 
tggtccgcgc 
ggcgccaagt 
ccgcccgcac 
gccccacccc 
tccacaactc 
ggcgcggaga 
ccatacgacc 
acttctggga 
atttcaacac 



gcagtggtac 
ggggctgtca 
cccggacctc 
ggacccccag 
ggaggtggcc 
aatcggaggc 
gagcctgcag 
ccgcacggtg 
gggccgcggc 
cgacaactgc 
cacccagcag 
ctacagcagc 
cacagaccag 
agcgctggag 
gtccaagccg 
gagccatcac 
ctggctgccc 
catcctgccg 
catccgctcg 
cctggagatc 
cttggacgtc 
tgagaaccca 
gggtgagggc 



atgaacagcg 
ggccagggca 
tgggccaact 
ccccgctaca 
gcgatggcca 
gtacggatgc 
ccgcagcaca 
gacggccccg 
gggctgggca 
aactgcgacg 
ggccgcgtgc 
ggcgtggcca 
cacacgggca 
gccaacccgt 
gcgcacctgc 
tacggatacg 
acccagccgc 
ctgatctaca 
ctggagcacg 
tcgctcacca 
agcactgaag 
cagggcgtgt 
ggggcggggc 



aggcccaacc 
tcgtggtctc 
acgaccccct 
cccccagcaa 
acaatggctt 
tggacggtac 
tccacattta 
gcatcctcac 
cgctcttcat 
gctacaccaa 
cctggtacag 
ccgaccccca 
cctcggcctc 
tcctgacgtg 
aggccgagga 
ggctgctgga 
agaggaagtg 
tcagggaaaa 
tgcaggcgca 
gccccatggg 
gctacaacaa 
ggaccctggg 
tgtggtgggc 



agacctgagc 
tgtgctggac 
ggccagctat 
agagaaccgg 
ctgtggtgtg 
catcaccgat 
cagcgccagc 
ccgcgaggcc 
ctgggcctcg 
cagcatccac 
cgaagcctgc 
gatcgtcacc 
agccccactg 
gagagacatg 
ctggaggacc 
cgccgggctg 
cgccgtccgg 
cgtatcggcc 
gctgacgctg 
cacgcgctcc 
ctgggtcttc 
cctagagaac 
ggggcttggc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1389 



<210> 43 

<211> 675 

<212> DNA 

<213> Homo sapiens 



<400;> 43 

atggcgtctc 

gttgaatatg 

aatatagttg 

gtgaggaaaa 

gatgctagag 

gaggacccag 

acaaagagtt 

agctgggatt 

aacaaacaaa 

aagaaagtaa 

accaaagaag 

gcagacacca 



gatatgacag 
cccaggaagc 
ttcttggggt 
tttgtgccct 
tagtaataaa 
tcactgtaga 
atttgaagtt 
accggcacat 
aatttgcagc 
tcccatttat 
tgaaagattt 
aaaaa 



ggcgatcact 
ggtgaagaaa 
agaaaaaaaa 
tgatgaccat 
cagagcccgt 
atacataact 
ttccagagaa 
gccaccacac 
atttctatat 
gatagctaca 
ccacaatgaa 



gtcttctccc 
ggatccaccg 
tctgttgcca 
gtctgcatgg 
gtggagtgcc 
cgcttcatag 
gtaccttttt 
ctggctaact 
gccaacaatg 
aataaaatta 
aactataaaa 



cagacggaca 
cggtcggaat 
agcttcaaga 
cttttgcagg 
agagccataa 
caactttaaa 
tgttttgttt 
tttttgcagg 
aacaatctga 
aatgcataga 
cactgatgca 



cctttttcaa 
tcgaggtacc 
tgaaagaact 
acttactgct 
gcttacggtt 
gcagattaat 
tttgtttttt 
atacaaaatc 
aaaagaaatc 
aataaactta 
agaaactgaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

675 



<210> 44 

<211> 684 

<212> DNA 

<213> Homo sapiens 

<400> 44 

tcaaaaggag gaatttcagt gggtctctgt gttcgggatg gggtggtggt ggtaagtaga 6 0 
gatactaaca gccctcacag agttactcct ctgctaaatg aactaatgtg tctcaggtgt 12 0 
tctgggctgg cagcagctgc gaagatggtt gcagcattca tctctctgag gagatcagca 180 
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gagataaata agtatgttat 
atgtccttga taaaaataaa 
gccttaattg taggttttga 
ggtacttatc atgcttggaa 
tttctagaaa agaattacac 
gcaataaaag ctttgctaga 
ataagaagaa atcaaccttt 
gaagaagaag gaggagaaga 
gaggtggaag aggaagagga 

<210> 45 

<211> 3015 

<212> DNA 

<213> Homo sapiens 

<400> 45 

gagaatggaa gtcttacatg gcaagaatta ctcagacaaa cagggaaatg ctcaataccc 60 
tgtctaatcg atacgggcgc tcaagcaaat attataacag aagaaactgt tcgagcacat 12 0 
aaactgccta ccagaccctg gtcaaaaagt gtgatatatg gtggagttta tccaaataag 18 0 
attaatcgca aaacaataaa acttaacata agtctaaatg gaatatcaat caaaacagaa 24 0 
ttcttggttg taaagaaatt ttcgcatcca gctgctatct ccttcacaac attatatgac 300 
aataacattg aaatatctag cagtaaacac acgctctctc aaatgaacaa agtttcaaat 3 60 
attgtcaagg aacctgagtt accagatatc tataaagaat tcaaagacat tactgcagaa 42 0 
accaatacgg aaaagctacc aaagccaata aaagggttag aatttgaagt tgaactaact 48 0 
caagaaaact acagattacc tatcagaaat tacccgctac caccgggaaa aatgcaagct 54 0 
atgaatgatg aaattaatca aggattaaaa agtggaatta tacgagaatc taaagccatt 60 0 
aacgcctgtc cagtaatgtt cgttccgaaa aaggaaggca ccttgagaat ggtggttgac 66 0 
tacaaacctt taaataagta tgtcaaaccc aatatatatc cgttaccact tattgaacaa 72 0 
ttacttgcta aaatacaagg ttctacaatt tttactaaac ttgacctcaa aagtgcctat 7 80 
cacttgatac gagtaagaaa aggagatgaa cataaacttg cttttcgctg tcctcgtgga 84 0 
gtttttgaat atctagtaat gccttatggc atatctatag ctccagcaca ttttcaatac 900 
tttatcaata caatacttgg tgaagtcaaa gaatcgcatg tagtatgtta tatggataat 960 
attttaattc attcaaaatc ggaatctgaa catgtaaaac atgttaaaga cgttctacag 1020 
aaattaaaaa atgcgaactt aattatcaat caagcaaaat gtgaatttca tcaatcacaa 1080 
gtaaaattta tagggtatca catttcggaa aaaggattta cgccttgtca agaaaatata 1140 
gacaaagtct tacaatggaa gcaacctaag aatcgtaaag aattacgaca atttctaggt 12 00 
tctgtcaatt atcttaggaa attcattcca aagacatcac aattaacaca tccactcaat 12 60 
aatcttttga aaaaggatgt acgctggaaa tggacaccaa cacaaaccca agcgatagaa 132 0 
aacattaaac aatgtttagt ttcgcctccg gtgctacgac actttgattt cagtaaaaag 13 80 
attctactag aaactgatgc ttcagatgtc gctgtaggag ccgtattgtc tcaaaaacat 1440 
gatgatgata aatactatcc tgttggatac tattcagcaa agatgtctaa agcacaatta 1500 
aattatagcg tatcggacaa agaaatgctt gcaatcatta agtctctcaa acattggaga 1560 
cactatttag aatccactat cgaacctttc aaaattttaa cagaccatcg aaacttaatt 1620 
ggtcgcatta ctaacgaatc cgagcctgaa aacaaacgtt tagctcgttg gcaattattt 16 80 
ttacaagact tcaactttga aattaactac agacctggat cagcaaatca catagctgat 174 0 
gccttatcca gaattgttga cgaaacagaa ccaattccaa aagattcaga agacaatagt 18 00 
atcaactttg ttaatcaaat ctcgataacc gatgatttta aaaaccaagt ggttacagaa 18 60 
tatacgaatg atacaaaatt gttgaattta ctaaacaatg aagacaaacg agtggaagag 192 0 
aatatccaac tcaaagatgg cttactaatt aacagtaaag accaaatctt attacctaat 1980 
gatactcagc tgactaggac aattattaaa aagtatcatg aagaaggtaa attgattcat 2 040 
ccaggcattg aacttcttac aaacattata ttacgtagat ttacgtggaa aggaataaga 2100 
aaacaaatac aagaatatgt acagaactgc catacatgtc aaataaacaa atctaggaat 2160 
cataaacctt atggaccttt acaaccaatt cccccatcag aaagaccttg ggaatcttta 2 220 
tcaatggatt ttattacagc tttaccagaa tcatctggtt ataatgcact tttcgtggta 22 80 
gttgaccgat tttcaaaaat ggcaatctta gtaccttgta cgaaatccat tacagcagag 2340 
caaacagctc gaatgtttga tcaacgagtt attgcttatt tcggcaatcc aaaagaaatc 2 4 00 
attgcagata atgatcatat ttttacttcc caaacgtgga aagatttcgc acataaatat 24 60 
aatttcgtta tgaaafctttc gttaccatac agaccacaaa ctgatggaca aactgagcgt 2520 
acaaaccaaa ctgtggagaa attactaaga tgtgtatgta gcacacatcc aaatacatgg 2 580 
gtagatcata tatccctagt gcaacaatct tacaacaatg cgatacattc agcaactcaa 2 64 0 
atgacacctt ttgagatagt acatcgctat tcaccagctt tatcaccttt agagttacct 2700 
agctttagtg acaaaactga cgaaaactct caggaaacga tccaagtatt tcaaacagtt 2 7 60 



atatccaaga gatgtatgca 
atatacccaa agcaatggac 
tgatgatggt atctcaagat 
ggcaaatgca ataggccgaa 
agaagatgcc atagcaagtg 
agttgtccag tctggtggaa 
gaagaagaaa gaggaggagg 
agaagaggag gaggaggaag 
agaa 



ccccttatat agtgaacaga 24 0 
gaagaccttt tggtatttct 300 
tgtatcagac agatccttct 360 
gtgctaaaac tgttcgagaa 42 0 
acagtgaagc tatcaagtta 480 
aaaacattga acttgctata 540 
aggaaagaag gaagaagaaa 60 0 
aggatgagga ggaagaggag 660 

684 
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aaagaacact tgaatacaaa caacataaag atgaaaaagt atttcgatat gaaaatacaa 2820 
gaaattgaag aatttcaacc tggagaccta gttatggtca aaagaacgaa aacagcattc 2 8 80 
ttatacacca ataacagaca aacagagagc caaatcatga gtgaactccc attcacaatt 2 94 0 
gcttcaaaga gaataaaata cctaggaatt caactgacaa gggaagtgaa ggacctcttc 3 000 
aaggagaact acaaa 3 015 

<210> 46 
<211> 585 
<212> DNA 

<213> Homo sapiens 



<400> 46 

ggccccagac 

gctgcagctg 

gtcatccctg 

acctggtatc 

cccagtgtgg 

gatctctgtg 

gtctatagcg 

tatgcctacg 

gccctggctc 

gtagaccttt 



tggcccacgg 
acacgcgttc 
tgcaccagca 
gggtattaca 
ccagtgctgc 
tggccactgc 
acggcacccg 
gcgtgctaga 
gctgcgccgt 
tccacgtgcg 



caccaccact 
ctcctgtggc 
cctcctgggt 
gcgggagctg 
caagctcttg 
cctctgcggc 
cctgcagggg 
ccgtggctat 
ggcccacgcc 
ggagagtgga 



cfcggccttcc 
agctatgtgg 
accacctctg 
cggcttcggg 
tcagccatga 
tgggaccgct 
gacatcttct 
cgctacgaca 
acccaccgtg 
tgggagcatg 



gcttccgtca 
cgtgtccagc 
gcacctctgc 
aactgaggga 
tgtctcaata 
ctggccctga 
ctgtgggctc 
tgagcaccca 
atgcctattc 
tgtca 



tggagtcatt 
ctcatgcaag 
cgactgtgct 
gggtcagctg 
ccggggactg 
gctcttctac 
tggatctccc 
ggaagcctac 
agggggctct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

585 



<210> 47 
<211> 594 
<212> DNA 

<213> Homo sapiens 



<400> 47 

tctattatgt 

atcgctgcag 

atctttccca 

agagttgccc 

atcaaacctt 

ccctactaca 

tgctctctag 

tatgccgaac 

cacccgtttg 

ggcatgggag 



cctataacgg 
acaggcactt 
tgggtggttg 
agtgcctcaa 
ataccttcac 
ctgatccagt 
acctcatcgg 
aaatgtacgg 
aaaccatctc 
tcatcatcca 



aggagccatc 
cgggatccag 
gttgtacatc 
gttccagctg 
gagcatggtg 
cattgctggt 
cttccccatg 
aatgtgtfgag 
cccagccatg 
catcaccaag 



atggccatga 
gcccagatgg 
ggtctggccg' 
aacctatatg 
gccaacttct 
ttggacctga 
gtgactgatg 
tccctctggg 
ctgaatgctg 
aaggacaaaa 



aggggaagaa 
tgaccacgga 
ggcttgccac 
agttgaagga 
tgtatgagaa 
agacctttaa 
actttgtggt 
aacccaacat 
tggactgggg 
tcaccaccag 



ccgtgtggcc 
cttccaggag 
tgacgtccag 
aggtcagcag 
acattttggc 
gcccttcagt 
caatggcagc 
ggatccagaa 
tgcagggtca 
gaca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

594 



<210> 48 
<211> 361 
<212> PRT 

<213> Homo sapiens 



<400> 48 
Leu Val Asp 
1 

Gly Thr lie 

Asp Thr Gly 
35 

Leu Ala Cys 
50 

Tyr Gin Ser 
65 

Met Thr Gly 

Asp Thr Asn 

Leu Tyr Tyr 
115 



Glu Gin 
5 

Gly He 
20 

Ser Ser 

Thr Asn 

Thr Ser 

He Leu 

85 
Gin He 
100 

Ala Pro 



Pro 

Gly 

Asn 

His 

Glu 

70 

Gly 

Phe 

Phe 



Leu Glu Asn 
Thr 
Leu 



Asn 

55 

Thr 

Tyr 

Gly 

Asp 



Pro Ala 
25 

Trp Val 
40 

Arg Phe 



Val Ser 

Asp Thr 

Leu Ser 
105 
Gly He 
12 0 



Tyr 


Leu 


Asp 


Met 


Glu 


Tyr 


Phe 


10 










15 




Gin 


Asp 


Phe 


Thr 


Val 


Val 


Phe 








30 






Pro 


Ser 


Val 


Tyr 
45 


Cys 


Ser 


Ser 


Asn 


Pro 


Glu 
60 


Asp 


Ser 


Ser 


Thr 


He 


Thr 


Tyr 


Gly 


Thr 


Gly 


Ser 




75 






80 


Val 


Gin 


Val 


Gly 


Gly 


He 


Ser 


90 










95 




Glu 


Thr 


Glu 


Pro 


Gly 
110 


Ser 


Phe 


Leu Gly Leu 


Ala 


Tyr 


Pro 


Ser 



125 
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He 


Ser 


Ser Ser 


Gly Ala 


Thr 


Pro 


Val 


Phe Asp Asn 


He 


Trp 


Asn 


Gin 




130 








135 






14 0 










Gly 


Leu 


Val Ser 


Gin 


Asp 


Leu 


Phe 


Ser 


Val Tyr Leu 


Ser Ala Asp Asp 


145 








150 








155 








160 


Lys 


Ser 


Gly Ser 


Val 


Val 


He 


Phe Gly Gly He Asp 


Ser 


Ser 


Tyr 


Tyr 








165 










170 






175 




Thr 


Gly 


Ser Leu 


Asn 


Trp 


Val 


Pro 


Val 


Thr Val Glu 


Gly Tyr 


Trp 


Gin 




180 










185 






190 






He 


Thr 


Val Asp 


Ser 


He Thr Met Asn Gly Glu Thr 


He 


Ala 


Cys 


Ala 






195 








200 






205 








Glu 


Gly 


Cys Gin 


Ala 


He 


Val Asp Thr Gly Thr Ser 


Leu Leu Thr Gly 




210. 








215 






220 










Pro 


Thr 


Ser Pro 


He 


Ala 


Asn 


He 


Gin 


Ser Asp He 


Gly Ala 


Ser 


Glu 


225 








230 








235 








240 


Asn 


Ser Asp Gly 


Asp 


Val 


Ser 


Pro 


Ala 


Pro Thr Ala 


Leu 


Phe 


Tyr 


Thr 








245 










250 






255 




Gin 


Val 


Val Gly 


Val 


Pro Gly Arg 


Ser 


Asp Glu Asn 


Pro 


Ser 


Asn 


Phe 






260 










265 






270 






Ser- 


His 


Pro His 
275 


Ser 


Phe 


Gin 


Met 
280 


Val 


Val Ser Cys 


Ser 
285 


Ala 


He 


Ser 


Ser 


Leu 


Pro Asp 


He 


Val 


Phe 


Thr 


lie 


Asn Gly Val 


Gin 


Tyr 


Pro 


Val 




290 






295 






300 










Pro 


Pro 


Ser Ala 


Tyr 


He 


Leu 


Gin 


Ser Glu Gly Ser 


Cys 


He 


Ser 


Gly 


305 






310 








315 








320 


Phe 


Gin Gly Met 


Asn 


Val 


Pro 


Thr 


Glu 


Ser Gly Glu 


Leu 


Trp 


He 


Leu 








325 










330 






335 




Gly Asp Val Phe 


He 


Arg 


Gin 


Tyr 


Phe 


Thr Val Phe 


Asp 


Arg 


Ala 


Asn 






340 










345 






350 






Asn Gin Val Gly 


Leu 


Ala 


Pro 


Val 


Ala 













355 360 



<210> 49 

<211> 361 

<212> PRT 

<213> Homo sapiens 



<400> 49 



Leu 


Val 


Asp 


Glu 


Gin 


Pro Leu Glu 


Asn 


Tyr 


Leu Asp 


Met Glu 


Tyr 


Phe 


1 






5 






10 








15 




Gly 


Thr 


He 


Gly 


He 


Gly Thr Pro 


Ala 


Gin Asp 


Phe 


Thr Val 


Leu 


Phe 






20 






25 








30 






Asp 


Thr 


Gly 


Ser 


Ser 


Asn Leu Trp 


Val 


Pro 


Ser 


Val 


Tyr Cys 


Ser 


Ser 






35 






40 










45 






Leu 


Ala 


Cys 


Thr 


Asn 


His Asn Arg 


Phe 


Asn 


Pro 


Glu 


Asp Ser 


Ser 


Thr 




50 






55 








60 








Tyr 


Gin 


Ser 


Thr 


Ser 


Glu Thr Val 


Ser 


He 


Thr 


Tyr 


Gly Thr Gly 


Ser 


65 










70 






75 








80 


Met 


Thr 


Gly 


He 


Leu 


Gly Tyr Asp 


Thr 


Val 


Gin 


Val 


Gly Gly 


He 


Ser 










85 






90 








95 




Asp 


Thr 


Asn 


Gin 


He 


Phe Gly Leu. 


Ser 


Glu 


Thr 


Glu 


Pro Gly 


Ser 


Phe 






100 






105 








110 






Leu 


Tyr 


Tyr 


Ala 


Pro 


Phe Asp Gly 


He 


Leu 


Gly Leu 


Ala Tyr 


Pro 


Ser 






115 






120 










125 






He 


Ser 


Ser 


Ser 


Gly Ala Thr Pro 


Val 


Phe 


Asp 


Asn 


He Trp 


Asn 


Gin 




13 0 








135 








14 0 








Gly 


Leu 


Val 


Ser 


Gin 


Asp Leu Phe 


Ser 


Val 


Tyr 


Leu 


Ser Ala Asp Asp 


145 










150 






155 








160 


Lys 


Ser 


Gly 


Ser 


Val 


Val He Phe 


Gly Gly 


He 


Asp 


Ser Ser 


Tyr 


Tyr 








165 






170 








175 




Thr 


Gly 


Ser 


Leu 


Asn 


Trp Val Pro 


Val 


Thr 


Val 


Glu 


Gly Tyr 


Trp 


Gin 






180 






185 








190 
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lie 


Thr 


Val 


Asp 


Ser 


He 


Thr Met Asn Gly Glu Thr 


He 


Ala 


Cys 


Ala 






195 








200 


205 








Glu 


Gly 


Cys 


Gin 


Ala 


He Val Asp Thr Gly Thr Ser Leu Leu Thr Gly 




210 










215 220 










Pro 


Thr 


Ser 


Pro 


lie 


Ala 


Asn He Gin Ser Asp He 


Gly Ala 


Ser 


Glu 


225 










230 


235 








240 


Asn 


Ser 


Asp 


Gly 


Asp 


Val 


Ser Pro Ala Pro Thr Ala 


Leu 


Phe 


Tyr 


Thr 










245 




250 






255 




Gin 


Val 


Val 


Gly 


Val 


Pro 


Gly Arg Ser Asp Glu Asn 


Pro 


Ser 


Asn 


Phe 








260 






265 




270 






Ser 


His 


Pro 


His 


Ser 


Phe 


Gin Met Val Val Ser Cys 


Ser 


Ala 


He 


Ser 






275 








2B0 


285 








Ser 


Leu 


Pro 


Asp 


He 


Val 


Phe Thr He Asn Gly Val 


Gin 


Tyr 


Pro 


Val 




290 








295 300 










Pro 


Pro 


Ser 


Ala 


Tyr 


He 


Leu Gin Ser Glu Gly Ser 


Cys 


He 


Ser Gly 


305 










310 


315 








320 


Phe 


Gin 


Gly 


Met 


Asn 


Val 


Pro Thr Glu Ser Gly Glu 


Leu 


Trp 


He 


Leu 










325 




330 






335 




Gly 


Asp 


Val 


Phe 


He 


Arg 


Gin Tyr Phe Thr Val Phe 


Asp 


Arg 


Ala 


Asn 








340 






345 




350 






Asn 


Gin 


Val 


Gly 


Leu 


Ala 


Pro Val Ala 














355 








360 











<210> 50 
<211> 305 
<212> PRT 

<213> Homo sapiens 



<400> 50 



Asp 


Lys 


Ala 


Trp 


Leu 


Lys 


Arg Gly Asn Lys Gin Phe Asn Glu Gly Lys 


1 








5 




10 




15 




Glu 


Ser 


Asp 


Arg 


Cys 


Leu 


He Phe Lys Cys Lys Asn Lys 


Asp 


Val 


Lys 








20 






25 


30 






Met 


He 


Glu 


Gin 


His 


Asn 


Gin Glu Tyr Ser Gin Gly Lys 


His 


Ser 


Phe 






35 








40 * 45 








Thr 


Met 


Ala 


Met 


Asn 


Ala 


Phe Gly Asp Met Thr Asn Glu 


Glu 


Phe 


Arg 




50 










55 60 








Gin 


Val 


Met 


Asn 


Gly 


Phe 


Gin Tyr Gin Lys His Arg Lys 


Gly Lys 


Gin 


65 










70 


75 






80 


Phe 


Gin 


Glu 


Arg 


Leu 


Leu 


Leu Glu He Pro Thr Ser Val 


Asp 


Trp 


Arg 










85 




90 




95 




Glu 


Lys 


Gly 


Tyr 


Met 


Thr 


Pro Val Lys Asp Gin Gly Gin 


Cys 


Gly 


Ser 






100 






105 


110 






Cys 


Trp 


Ala 


Phe 


Ser 


Ala 


Thr Gly Ala Leu Glu Gly Gin 


Met 


Phe 


Trp 






115 








120 125 








Lys 


Thr Gly 


Lys 


Leu 


He 


Ser Leu Asn Glu Gin Asn Leu 


Val 


Asp 


Cys 


130 










135 140 








Ser 


Gly Pro 


Gin 


Gly 


Asn 


Glu Gly Cys Asn Gly Asp Phe 


Met 


Asp 


Asn 


145 










150 


155 






160 


Pro 


Phe 


Arg 


Tyr 


Val 


Gin Glu Asn Gly Gly Leu Asp Ser 


Glu 


Ala 


Ser 










165 




170 




175 




Tyr 


Pro Tyr 


Glu 


Gly Lys 


Val Lys Thr Cys Arg Tyr Asn 


Pro 


Lys 


Tyr 








180 






185 


190 






Ser 


Ala 


Ala 


Asn 


Asp 


Thr Gly Phe Val Asp He Pro Ser 


Arg 


Glu 


Lys 






195 








200 205 








Asp 


Leu 


Ala 


Lys 


Ala 


Val 


Ala Thr Val Gly Pro He Ser 


Val 


Ala 


Val 


210 








215 220 








Gly 


Ala 


Ser 


His 


Val 


Phe 


Phe Gin Phe Tyr Lys Lys Gly 


He 


Tyr 


Phe 


225 










230 


235 






240 


Glu 


Pro Arg 


Cys 


Asp 


Pro Glu Gly Leu Asp His Ala Met 


Leu 


Val 


Val 










245 




250 




255 
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Gly Tyr Ser Tyr Glu Gly Ala Asp 
260 

Val Lys Asn Arg Glu Ser Phe Phe 
275 280 
Ala Leu Asn His lie Ser Glu lie 
290 295 

Gin 
305 



Ser Asp Asn Asn Lys Tyr Trp Leu 
265 270 
Ala lie Ser He Tyr He Pro Gin 
285 

Pro Glu Val Phe Phe Pro Leu Arg 
300 



<210> 51 

<211> 190 

<212> PRT 

<213> Homo sapiens 

<400> 51 

Ser Asn Val Lys Pro Arg Lys Lys Asn Pro Lys Gly Phe Thr Val Asn 

1 5 10 15 

Val Leu Ala Asn He Gly Val He Asn His Trp Lys Leu Pro Lys Leu 

20 25 30 

Thr His Lys Gly Glu Lys Asn Ser Gly Lys Glu Gly Glu Lys Pro His 

35 40 45 

Lys Thr Asn Ser Ala Ser Asn Ser Cys Gly Thr He Ser Asn Leu Gly 

50 55 60 

Ser He Glu Asn Gly Thr Asn Glu Gly Lys Tyr Lys He Asn Phe He 
65 70 75 80 

Leu Lys Val He Leu Lys Asn Ala Asn Thr Val Arg Thr Glu Lys Arg 

85 90 95 

Asn Arg Gin He His Asn Tyr Asn Cys Leu His Cys Thr Val Glu Thr 

100 105 HO 

Ser Asp Tyr Gin Val Gin Ala Pro Ser He Asp Glu Lys Val Asp Leu 

115 120 125 

His Phe He Ala Leu Val His Val Asp Gly His Leu Tyr Glu Leu Asp 

130 135 140 

Gly Arg Lys Pro Phe Pro He Asn His Gly Glu Thr Ser Asp Glu Thr 
145 * 150 155 160 

Leu Leu Glu Asp Ala He Glu Val Cys Lys Lys Phe Met Glu Arg Asp 

165 170 175 

Pro Asp Glu Leu Arg Phe Asn Ala He Ala Leu Ser Ala Ala 
180 185 190 



<210> 52 
<211> 204 
<212> PRT 

<213> Homo sapiens 



<400> 52 








His 




Val 




Ala 


Cys 


Leu 


Val 


Ser Gly Phe Gly Pro Phe Arg 


Gin 


Leu 


Asn 


1 






5 * 10 








15 




Ser 


Ser 


Trp 


Glu 


Ala Val Lys Glu Leu Ser Lys 


Leu 


Gly 


Leu 


Gly Asn 






20 


25 






30 






Glu 


Thr 


Val 


Val 


Gin Leu Arg Thr Leu Glu Leu 


Pro 


Val 


Asp 


Tyr 


Arg 






35 




40 




45 






Gin 


Glu 


Ala 


Lys 


Arg 


Arg Val Thr Gly He Trp Glu 


Asp 


His 


Gin 


Pro 




50 




55 


60 










Leu 


Val 


Val 


His 


Val Gly Met Asp Thr Ala Ala Lys 


Ala 


He 


He 


Leu 


65 








70 75 






He 




80 


Glu 


Gin 


Ser 


Gly 


Lys Asn Gin Gly Tyr Arg Asp 


Ala 


Asp 


Arg 


Ser 








85 90 








95 




Phe 


Trp 


Pro 


Glu 


Gly Gly Val Cys Leu Pro Gly 


Ser 


Pro 


Asp 


Val 


Leu 






100 


105 






110 
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Glu 


Ser 


Gly Val 


Cys 


Met 


Lys Ala 






115 








12 0 


Gly 


Val 


Asp 


Val 


He 


Phe 


Ser Arg 




130 










135 


Tyr 


Thr 


Tyr 


Tyr 


Leu 


Ser 


Leu His 


145 










150 




He 


His 


Val 


Pro 


Pro 


Leu 


Ser Arg 










165 






Arg 


Ala 


Leu 


Arg 


Val 


He 


He Gin 








180 








Lys 


Gin 


Lys 


Glu 


Val 


Thr 


Ala Ser 






195 








200 



Val Cys Lys Arg Val Ala Val Glu 
125 

Asp Ala Gly Arg Tyr Val Cys Asp 
140 

His Gly Lys Gly Cys Ala Ala Leu 
155 160 
Gly Leu Pro Ala Ser Leu Leu Gly 

170 175 
Glu Met Leu Glu Glu Ala Gly Glu 
185 190 
Gly Thr Ser His 



<210> 53 

<211> 358 

<212> PRT 

<213> Homo sapiens 



<400> 53 
Arg Lys Lys Ser 
1 

Gly Asn Thr Cys 
20 

Pro Leu Leu Lys 
35 

Thr Ser Pro Ser 
50 

Ala Met Tyr Ser 
65 

His Leu He Trp 

Asp Ala His Glu 
100 

Ser Lys Asp Asp 
115 

Asn Cys He He 
130 

Thr Cys Gin Ala 
145 

Asp He Ser Leu 

Asn Pro Glu Arg 
180 

Gly He Pro Ser 
195 

His Leu Gly Ser 
210 

Gin Glu Ser Thr 
225 

Cys Phe His Leu 

He Asn Thr Phe 
260 

Leu Ala Ser Thr 
275 

Asp Cys Val Pro 
290 

His His Gly Thr 
305 

Gin Lys Asp Gin 



Val Tyr Thr Val 
5 

Phe Met Asn Cys 

Asp Phe Phe Leu 
40 

Leu Cys Leu Val 
55 

Gly Ser Arg Thr 
70 

He His Ala Glu 
85 

Phe Leu He Ala 

Ser Gly Gly Gin 
120 

Asp Gin He Phe 
135 

Cys His Ser Val 
150 

Asp Leu Pro Gly 
165 

Ala Asp Ser Thr 

Leu Thr Asp Cys 
200 

Ser Ala Lys He 
215 

Lys Gin Leu Thr 
230 

Lys Arg Phe Glu 
24 5 

He Ser Phe Pro 

Lys Glu Ser Arg 
280 

Asn Glu Asn Lys 
295 

Leu Glu Ser Gly 
310 

Trp Phe Ser Cys 
325 



Gly Leu Arg Gly 
10 

He Val Gin Ala 
25 

Ser Asp Lys His 

Cys Glu Met Ser 
60 

Pro His He Pro 
75 

His Leu Ala Gly 
90 

He Leu Asp Val 
105 

Glu Ala Asn Asn 

Thr Gly Gly Leu 
14 0 

Ser Thr Thr He 
155 

Ser Cys Ala Thr 
170 

Val Ser Arg Asp 
185 

Leu Gin Trp Phe 

Lys Cys Asn Ser 
22 0 

Met Lys Lys Leu 
235 

His Val Gly Lys 
250 

Leu Glu Leu Asp 
265 

Met Lys Glu Gly 

Tyr Ser Leu Phe 
300 

His Tyr Thr Ser 
315 

Asp Asp Ala He 
330 



Leu He Asn Leu 
15 

Leu Thr His He 
30 

Lys Cys He Met 
45 

Ser Leu Phe His 

Tyr Lys Leu Leu 
80 

Tyr Arg Gin Gin 
95 

Leu His Arg His 
110 

Pro Asn Cys Cys 
125 

Gin Ser Asp Val 

Asp Pro Cys Trp 
160 

Phe Asp Ser Gin 
175 

Asp His He Pro 
190 

Thr Arg Pro Glu 
205 

Cys Gin Ser Tyr 

Pro He Val Ala 
240 

Gin Arg Arg Lys 
255 

Met Thr Pro Phe 
270 

Gin Pro Pro Thr 
285 

Ala Val He Asn 

Phe He Arg Gin 
320 

He Thr Lys Ala 
335 
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Thr lie Glu Asp Leu Leu Tyr Ser Glu Gly Tyr Leu Leu Phe Tyr His 

340 345 350 

Lys Gin Gly Leu Glu Lys 
355 



<210> 54 

<211> 358 

<212> PRT 

<213> Homo sapiens 

<400> 54 

Arg Lys Lys Ser Val Tyr Thr Val Gly Leu Arg Gly Leu He Asn Leu 

15 10 15 

Gly Asn Thr Cys Phe Met Asn Cys He Val Gin Ala Leu Thr His lie 

20 25 30 

Pro Leu Leu Lys Asp Phe Phe Leu Ser Asp Lys His Lys Cys He Met 

35 40 45 

Thr Ser Pro Ser Leu Cys Leu Val Cys Glu Met Ser Ser Leu Phe His 

50 55 60 

Ala Met Tyr Ser Gly Ser Arg Thr Pro His He Pro Tyr Lys Leu Leu 
65 70 75 80 

His Leu He Trp He His Ala Glu His Leu Ala Gly Tyr Arg Gin Gin 

85 90 95 

Asp Ala His Glu Phe Leu lie Ala He Leu Asp Val Leu His Arg His 

100 105 HO 

Ser Lys Asp Asp Ser Gly Gly Gin Glu Ala Asn Asn Pro Asn Cys Cys 

115 120 125 

Asn Cys He He Asp Gin He Phe Thr Gly Gly Leu Gin Ser Asp Val 

130 135 140 

Thr Cys Gin Ala Cys His Ser Val Ser Thr Thr He Asp Pro Cys Trp 
145 150 155 160 

Asp He Ser Leu Asp Leu Pro Gly Ser Cys Ala Thr Phe Asp Ser Gin 

165 170 175 

Asn Pro Glu Arg Ala Asp Ser Thr Val Ser Arg Asp Asp His He Pro 

180 185 190 

Gly He Pro Ser Leu Thr Asp Cys Leu Gin Trp Phe Thr Arg Pro Glu 

195 200 205 

His Leu Gly Ser Ser Ala Lys He Lys Cys Asn Ser Cys Gin Ser Tyr 

210 215 220 

Gin Glu Ser Thr Lys Gin Leu Thr Met Lys Lys Leu Pro He Val Ala 
225 230 235 240 

Cys Phe His Leu Lys Arg Phe Glu His Val Gly Lys Gin Arg Arg Lys 

245 " 250 255 

He Asn Thr Phe He Ser Phe Pro Leu Glu Leu Asp Met Thr Pro Phe 

260 265 270 

Leu Ala Ser Thr Lys Glu Ser Arg Met Lys Glu Gly Gin Pro Pro Thr 

275 280 285 

Asp Cys Val Pro Asn Glu Asn Lys Tyr Ser Leu Phe Ala Val He Asn 

290 295 300 

His His Gly Thr Leu Glu Ser Gly His Tyr Thr Ser Phe He Arg Gin 
305 310 315 320 

Gin Lys Asp Gin Trp Phe Ser Cys Asp Asp Ala He He Thr Lys Ala 

325 330 335 

Thr He Glu Asp Leu Leu Tyr Ser Glu Gly Tyr Leu Leu Phe Tyr His 

340 345 350 

Lys Gin Gly Leu Glu Lys 
355 



<210> 55 
<211> 357 
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<212> PRT 

<213> Homo sapiens 



<400> 55 
Arg lie Thr Ser 
1 

Gly Asn Thr Cys 
20 

Pro lie Leu Arg 
35 

Pro Ser Pro Glu 
50 

Glu Leu Tyr Ser 
65 

His Leu Val Trp 

Asp Ala His Glu 
100 

Cys Lys Gly Asp 
115 

Asn Cys lie lie 
130 

Thr Cys Gin Ala 
145 

Asp He Ser Leu 

Ser Pro Gly Arg 
180 

He Thr Thr Leu 
195 

Leu Gly Ser Ser 
210 

Glu Ser Thr Lys 
225 

Phe His Phe Lys 

Thr Thr Tyr He 
260 

Ala Ser Ser Lys 
275 

Asn Ser Gly Asn 
290 

His Gin Gly Thr 
305 

His Lys Asp Gin 

Ser He Lys Asp 
340 

Lys Gin Val Leu 
355 



Ser Phe Thr He 
5 

Phe Met Asn Cys 

Asp Phe Phe Leu 
40 

Leu Cys Leu Val 
55 

Gly Asn Pro Ser 
70 

He His Ala Arg 
85 

Phe Leu He Ala 

Asp Val Gly Lys 
120 

Asp Gin He Phe 
135 

Cys His Gly Val 
150 

Asp Leu Pro Gly 
165 

Glu Ser Ser Val 

Thr Asp Cys Leu 
200 

Ala Lys He Lys 
215 

Gin Leu Thr Met 
230 

Arg Phe Glu His 
245 

Ser Phe Pro Leu 

Glu Ser Arg Met 
280 

Asn Glu Asn Lys 
295 

Leu Glu Ser Gly 
310 

Trp Phe Lys Cys 
325 

Val Leu Asp Ser 
Glu 



Gly Leu Arg Gly 
10 

He Val Gin Ala 
25 

Ser Asp Arg His 

Cys Glu Met Ser 
60 

Pro His Val Pro 
75 

His Leu Ala Gly 
90 

Ala Leu Asp Val 
105 

Ala Ala Asn Asn 

Thr Gly Gly Leu 
140 

Ser Thr Thr He 
155 

Ser Cys Thr Ser 
170 

Asn Gly Glu Ser 
185 

Arg Arg Phe Thr 

Cys Gly Ser Cys 
220 

Asn Lys Leu Pro 
235 

Ser Ala Lys Gin 
250 

Glu Leu Asp Met 
265 

Asn Gly Gin Leu 

Tyr Ser Leu Phe 
300 

His Tyr Thr Ser 
315 

Asp Asp Ala Val 
330 

Glu Gly Tyr Leu 
345 



Leu He Asn Leu 
15 

Leu Thr His Thr 
30 

Arg Cys Glu Met 
45 

Ser Leu Phe Arg 

Tyr Lys Leu Leu 
80 

Tyr Arg Gin Gin 
95 

Leu His Arg His 
110 

Pro Asn His Cys 
125 

Gin Ser Asp Val 

Asp Pro Cys Trp 
160 

Phe Trp Pro Met 
175 

His He Pro Gly 
190 

Arg Pro Glu His 
205 

Gin Ser Tyr Gin 

Val Val Ala Cys 
240 

Arg Arg Lys He 
255 

Thr Pro Phe Met 
270 

Gin Leu Pro Thr 
285 

Ala Val Val Asn 

Phe He Arg His 
320 

He Thr Lys Ala 
335 

Leu Phe Tyr His 
350 



<210> 56 
<211> 308 
<212> PRT 

<213> Homo sapiens 
<400> 56 

Leu Ser Ser Arg Arg Pro Ala Ala Val Gly Ala Gly Leu Gin Asn Met 

1 5 10 15 

Gly Asn Thr Cys Tyr Glu Asn Ala Ser Leu Gin Cys Leu Thr Tyr Thr 
20 25 30 
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Pro Pro Leu Ala 

35 

Gin Arg Pro Lys 
50 

Trp Ala Leu His 
65 

Ala Ala Gly Phe 

Met Phe Thr Val 
100 

Gin Val Asp His 
115 

Gly Gly Cys Trp 
130 

Ser Asp Thr Phe 
145 

Ala Gin Ser Val 

Leu Asn Gly Glu 
180 

Pro Ala Ser Lys 
195 

Leu Val Leu Lys 
210 

Asn Val Gin Tyr 
225 

Gin Asn Thr Gly 

Ala Gly Trp Ser 
260 

Gin Glu Gly Gin 
275 

Ser lie Thr Ser 
290 

Gin Lys Ser Glu 
305 



Asn Tyr Met Leu 
40 

Cys Cys Met Leu 
55 

Ser Pro Gly His 
70 

His Arg Gly Lys 
85 

Asp Ala Met Lys 

His Ser Lys Asp 
120 

Arg Ser Gin He 
135 

Asp Pro Tyr Leu 
150 

Lys Gin Ala Leu 
165 

Asn Ala Tyr His 

Thr Leu Thr Leu 
200 

Arg Phe Ser Asp 
215 

Pro Glu Cys Leu 
230 

Pro Leu Val Tyr 
245 

Cys His Asp Gly 

Trp Tyr Lys Met 
280 

Val Leu Ser Gin 
295 



Ser Arg Glu His 

Cys Thr Met Gin 
60 

Val He Gin Pro 
75 

Gin Glu Asp Ala 
90 

Lys Ala Cys Leu 
105 

Thr Thr Leu He 

Lys Cys Leu His 
14 0 

Asp He Ala Leu 
155 

Glu Gin Leu Val 
170 

Cys Gly Leu Cys 
185 

His Thr Ser Ala 

Val Thr Gly Asn 
220 

Asp Met Gin Pro 
235 

Val Leu Tyr Ala 
250 

His Tyr Phe Ser 
265 

Asp Asp Ala Lys 

Gin Ala Tyr Val 
300 



Ser Gin Thr Cys 
45 

Ala His He Thr 

Ser Gin Ala Leu 
80 

His Glu Phe Leu 
95 

Pro Gly His Lys 
110 

His Gin He Phe 
125 

Cys His Gly He 

Asp He Gin Ala 
160 

Lys Pro Glu Glu 
175 

Leu Gin Arg Ala 
190 

Lys Val Leu He 
205 

Lys Leu Ala Lys 

Tyr Met Ser Gin 
240 

Val Leu Val His 
255 

Tyr Val Lys Ala 
270 

Val Thr Ala Cys 
285 

Leu Phe Tyr He 



<210> 57 
<211> 337 
<212> PRT 

<213> Homo sapiens 
<400> 57 



Cys 


Glu 


Lys 


Arg 


Glu 


Asn 


Leu 


Leu 


Pro 


Phe 


Val 


Gly 


Leu 


Asn 


Asn Leu 


1 




5 










10 










15 


Gly 


Asn 


Thr 


Cys 


Tyr 


Leu 


Asn 


Ser 


He 


Leu 


Gin 


Val 


Leu 


Tyr 


Phe Cys 






20 










25 










30 




Pro 


Gly 


Phe 


Met 


Tyr 


Cys 


He 


Phe 


Lys 


Thr Arg 


He 


Asp 


Glu 


Met Glu 




35 










40 










45 






He 


Phe 


He 


Tyr 


Arg 


Glu 


Leu 


Asn 


Pro 


Met 


Tyr 


Glu 


Gly Tyr 


Leu Gin 




50 










55 










60 








His 


Asp 


Ala 


Gin 


Glu 


Val 


Leu 


Gin 


Cys 


He 


Leu Gly Asn 


He 


Gin Glu 


65 








70 










75 








80 


Thr 


Cys 


Gin 


Leu 


Leu 


Lys 


Lys 


Glu 


Glu 


Val 


Lys 


Asn 


Val 


Ala 


Glu Leu 








85 










90 










95 


Pro 


Thr 


Lys 


Val 


Glu 


Glu 


He 


Pro 


His 


Pro 


Lys 


Glu 


Glu 


Met 


Asn Gly 






100 










105 










110 




Glu 


Glu 


Gin 


He 


Gly 


Phe 


Glu 


Leu 


Val 


Glu 


Lys 


Leu 


Phe 


Gin Gly Gin 






115 








120 










125 






Leu 


Val 


Leu Arg 


Thr 


Arg 


Cys 


Leu 


Glu 


Cys 


Glu 


Ser 


Leu 


Thr Glu Arg 




130 










135 










140 
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Arg 


Glu 


Asp 


Phe 


145 








Ser 


Lys 


Val 


Glu 


He 


Ser 


Gin 


Phe 








180 


Phe 


Cys 


Glu 


Asn 






195 




Phe 


Asp 


Lys 


Met 




210 






Ala 


Ser 


Gly Asn 


225 








Gly 


Gly 


Gly Leu 


Leu 


Ser 


Leu 


Glu 








260 


Leu 


Phe 


Ala 


Val 






275 




Tyr 


Thr 


Ala 


Ser 




290 






Asp 


Ser 


Glu 


Val 


305 








Ser 


Pro 


Ser 


Thr 



Lys 



Gin Asp He Ser 
150 

Glu Ser Ser Glu 
165 

Ala Ser Val Glu 

Cys His His Tyr 
200 

Pro Glu Val He 
215 

Leu Phe Phe Phe 
230 

Ser Lys He Asn 
245 

Glu Trp Ser Thr 

Val Met His Ser 
280 

Val Lys Glu Tyr 
295 

Lys Val Thr Glu 
310 

Ser Pro Thr Ser 
325 



Val Pro Val" Gin 
155 

Lys Met Lys Thr 
170 

Arg He Val Gly 
185 

Thr Glu Ala Glu 

Thr He His Leu 
220 

Ser Ser Gin Arg 
235 

Thr Pro Leu Leu 
250 

Lys Pro Thr Asn 
265 

Gly He Thr He 

Glu Gly Lys Trp 
300 

Glu Lys Asp Phe 
315 

Thr Pro Tyr Leu 
330 



Glu Asp Glu Leu 
160 

Leu Arg Trp Ala 
175 

Glu Asp Lys Tyr 
190 

Arg Ser Leu Leu 
205 

Lys Cys Phe Ala 

Phe Asp Cys Tyr 
240 

Thr Pro Leu Lys 
255 

Asp Ser Tyr Gly 
270 

Ser Ser Gly His 
285 

Leu Leu Phe Asp 

Leu Asn Ser Leu 
320 

Leu Phe Tyr Lys 
335 



<210> 58 

<211> 308 

<212> PRT 

<213> Homo sapiens 



<400> 58 



Leu 


Ser 


Ser 


Arg 


Arg 


Pro Ala Ala 


Val 


Gly Ala Gly Leu 


Gin 


Asn 


Met 


1 








5 






10 










15 




Gly 


Asn 


Thr 


Cys 


Tyr 


Glu Asn Ala 


Ser 


Leu 


Gin 


Cys 


Leu 


Thr 


Tyr 


Thr 








20 






25 










30 






Leu 


Pro 


Leu 


Ala 


Ash 


Tyr. Met Leu 


Ser Arg 


Glu 


His 


Ser 


Gin 


Thr 


Cys 






35 






40 










45 








Gin 


Arg 


Pro 


Lys 


Cys 


Cys Met Leu 


Cys 


Thr 


Met 


Gin 


Ala 


His 


He 


Thr 




50 








55 








60 










Trp 


Ala 


Leu 


His 


Ser 


Pro Gly His Val 


He 


Gin 


Pro 


Ser 


Gin 


Ala 


Leu 


65 










70 






75 










80 


Ala 


Ala 


Gly 


Phe 


His 


Arg Gly Lys Gin Glu Asp Val 


His 


Glu 


Phe 


Leu 










85 






90 










95 




Met 


Phe 


Thr 


Val 


Asp 


Ala Met Lys 


Lys 


Ala 


Cys 


Leu 


Pro 


Gly 


His 


Lys 








100 






105 










110 






Gin 


Val 


Asp 


His 


His 


Cys Lys Asp 


Thr 


Thr 


Leu 


He 


His 


Gin 


He 


Phe 






115 






12 0 










125 








Gly 


Gly 


Cys 


Trp 


Arg 


Ser Gin He 


Lys 


Cys 


Leu 


His 


Cys 


His 


Gly 


He 




130 








135 








140 










Ser 


Asp 


Thr 


Phe 


Asp 


Pro Tyr Leu Asp 


He 


Ala 


Leu Asp 


He 


Gin 


Ala 


145 










150 






155 










160 


Ala 


Gin 


Ser 


Val 


Lys 


Gin Ala Leu 


Glu 


Gin 


Leu 


Val 


Lys 


Pro 


Glu 


Glu 










165 






170 










175 




Leu 


Asn 


Gly 


Glu 


Asn Ala Tyr His 


Cys 


Gly 


Leu 


Cys 


Leu 


Gin 


Arg Ala 








180 






185 










190 






Pro 


-Ala 


Ser 


Asn 


Thr 


Leu Thr Leu 


His 


Thr 


Ser 


Ala 


Lys 


Val 


Leu 


He 






195 






200 










205 








Leu 


Val 


Leu 


Lys 


Arg 


Phe Ser Asp 


Val 


Ala 


Gly Asn Lys 


Leu 


Ala Lys 




210 








215 








220 
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Asn Val 


Gin 


Tyr 


Pro 


Glu 


Cys 


Leu 


Asp Met Gin 


Pro 


Tyr 


Met 


Ser 


Gin 


225 








230 








235 










240 


Gin Asn 


Thr 


Gly 


Pro 


Leu 


Val 


Tyr 


Val 


Leu Tyr 


Ala 


Val 


Leu 


Val 


His 








245 










250 








255 




Ala Gly 


Trp 


Ser 


Cys 


His Asp Gly His Tyr Phe 


Ser 


Tyr 


Val 


Lys 


Ala 






260 










265 








270 






Gin Glu 


Val 


Gin 


Trp 


Tyr 


Lys 


Met 


Asp 


Asp Ala 


Glu 


Val 


Thr 


Val 


Cys 




275 










280 








285 








Ser lie 


He 


Ser 


Val 


Leu 


Ser Gin Gin Ala Tyr Val 


Leu 


Phe 


Tyr 


He 


290 










295 








300 










Gin Lys 


Ser 


Glu 
























305 




























<210> 59 


























<211> 232 


























<212> PRT 


























<213> Homo sapiens 






















<400> 59 


























Tyr Asp 


Arg 


Lys 


Arg 


Gin 


Asp 


Lys 


Ala 


Pro Val' Gly Leu Lys 


Asn 


Val 


1 






5 










10 








15 




Gly Asn 


Thr 


Cys 


Trp 


Phe 


Ser 


Ala 


Val 


He Gin 


Ser 


Leu 


Phe 


Asn 


Leu 




20 










25 








30 






Leu Glu 


Phe 


Arg 


Arg 


Leu 


Val 


Leu 


Asn 


Tyr Lys 


Pro 


Pro 


Ser. Asn 


Ala 




35 










40 








45 








Gin Asp 


Leu 


Pro 


Arg 


Asn 


Gin 


Lys 


Ala 


Phe Phe 


Phe 


Ser Gin Gin Asp 


50 










55 








60 










Val Ser 


Glu 


Phe 


Thr 


His 


Lys 


Leu 


Leu Asp Trp Leu Glu Asp Ala 


Phe 


65 








70 








75 










80 


Gin Met 


Lys 


Ala 


Glu 


Glu 


Glu 


Thr 


Val 


Gly Lys 


Asp 


Val 


Glu 


Lys 


Leu 






85 










90 








95 




Lys Pro 


Leu 


Cys 


Ser 


Val 


Gly Glu Asp 


Met Lys 


Trp 


Tyr 


Ser 


His 


Cys 






100 










105 








110 






Gly Lys 


His 


Phe 


Cys 


Tyr 


Cys 


Phe 


He 


Ser Phe 


Gin 


His 


Trp 


Phe 


Thr 




115 










120 








125 








Glu Leu 


Pro 


Pro 


Val 


Leu 


Thr 


Phe 


Glu 


Leu Ser 


Arg 


Phe 


Glu 


Phe 


Asn 


130 










135 








14 0 










Gin Ala 


Leu Gly 


Arg 


Pro 


Glu 


Lys 


He 


His Asn 


Lys 


Leu 


Glu 


Phe 


Pro 


145 








150 








155 










160 


Gin Val 


Pro 


Tyr 


Arg 


Leu 


His 


Ala 


Val 


Leu Val 


His 


Glu Gly Gin Ala 








165 










170 








175 




Asn Ala 


Gly His 


Tyr 


Trp 


Ala 


Tyr 


He 


Phe Asp His Arg 


Glu 


Ser Arg 






180 










185 








190 






Trp Met 


Lys 


Tyr 


Asn 


Asp 


He 


Ala 


Val 


Thr Lys 


Ser 


Ser 


Trp 


Glu 


Glu 




195 










200 








205 








Leu Val 


Arg Asp 


Ser 


Phe 


Gly Gly Tyr Arg Asn 


Ala 


Ser 


Ala 


Tyr 


Cys 


210 










215 








220 










Leu Met 


Tyr 


He 


Asn 


Asp 


Lys 


Ala 

















225 230 



<210> 60 
<211> 228 
<212> PRT 

<213> Homo sapiens 
<400> 60 

Asn Asp Trp Arg Arg Val Asp Gly Trp Pro Val Gly Leu Lys Asn Val 

15 .10 15 

Gly Asn Thr Cys Trp Phe Ser Ala Val He Gin Ser Leu Phe Gin Leu 
20 25 30 
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Pro Glu Phe Arg 
35 

Leu Glu Asn Cys 
50 

Phe Thr His Lys 
65 

Val Asn Thr Phe 

Leu Asp Glu Cys 
100 

Leu Pro Ser Asp 
115 

Lys Leu Pro Pro 
13 0 

His Ser Trp Gly 
145 

Pro Tyr Arg Leu 

Gly His Tyr Trp 
180 

Lys Tyr Asn Asp 
195 

Arg Asp Ser Tyr 
210 

Tyr lie Asn Asp 
225 



Arg Leu Val Leu 
40 

Arg Ser His Thr 
55 

Leu Leu Asp Trp 
70 

Gly Gin Tyr Pro 
85 

Leu Glu Gly Ala 

His Ser Val Lys 
12 0 

Val Leu Thr Phe 
135 

Arg Asp Lys Lys 
150 

His Ala Val Leu 
165 

Ala Tyr lie Tyr 

lie Ser Val Thr 
200 

Gly Gly Leu Arg 
215 



Ser Tyr Ser Leu 

Glu Gin Gin Gin 
60 

Leu Glu Asp Ala 
75 

Leu Gin Val Asn 
90 

Met Val Glu Gly 
105 

Tyr Gly Gin Glu 

Glu Leu Ser Arg 
140 

Asp Ser Lys Ala 
155 

Val His Glu Gly 
170 

Asn Gin Pro Arg 
185 

Glu Ser Ser Trp 

Asn Val Ser Ala 
220 



Pro Gin Asn Val 
45 

Asp Val Ser Glu 

Phe Gin Leu Ala 
80 

Gly Tyr Arg Asn 
95 

Asp Val Glu Leu 
110 

Arg Trp Phe Thr 
125 

Phe Glu Phe Asn 

Leu His Thr Val 
160 

Gin Ala Asn Ala 
175 

Gin Ser Trp Leu 
190 

Glu Glu Val Glu 
205 

Tyr Cys Leu Met 



<210> 61 
<211> 256 
<212> PRT 

<213> Homo sapiens 



<400> 61 
























Ser 


Gly 


Ser 


Ser 


Pro 


Ser 


Ser 


Ser 


Trp Pro 


Ser Gly Leu Arg 


Ser 


Ser 


1 






5 








10 








15 




Cys 


Pro 


He 


Phe 


Gin 


Cys 


Leu 


Phe 


Met Leu 


His Leu 


Leu 


Ser 


Arg 


Ser 






20 










25 






30 






Gin 


Ser 


Phe 


Leu 


Trp 


Pro 


Arg 


Val 


Arg Met 


Arg Arg 


Gin 


His 


Gly Ala 






35 










40 






45 








Leu 


Glu 


Phe 


His 


Arg 


Val 


Leu 


Phe 


Gly Ser 


Leu Gin 


Glu 


Glu 


Arg 


Ala 




50 










55 






60 










Gin 


Asp 


Ala 


Asp 


Ser 


Val 


Trp 


Gin 


Gin Gin 


Gin Ala 


His 


Gin 


Gin 


His 


65 








70 








75 








80 


Ser Cys 


Thr 


Leu 


Asp 


Glu 


Cys 


Phe 


Gin Phe 


Tyr Thr 


Lys 


Glu 


Glu 


Gin 










85 








90 








95 




Leu 


Ala 


Gin 


Asp 


Asp 


Ala 


Trp 


Lys 


Cys Pro 


His Cys 


Gin 


Val 


Leu 


Gin 








100 










105 






110 






Gin Gly 


Met 


Val 


Lys 


Leu 


Ser 


Leu 


Trp Thr 


Leu Pro 


Asp 


He 


Leu 


He 






115 










120 






125 








He 


His 


Leu 


Lys 


Arg 


Phe 


Cys 


Gin Val Gly Glu Arg Arg 


Asn 


Lys 


Leu 




130 










135 






140 










Ser 


Thr 


Leu 


Val 


Lys 


Phe 


Pro 


Leu 


Ser Gly 


Leu Asn 


Met 


Ala 


Pro 


His 


145 








150 








155 








160 


Val 


Ala 


Gin 


Arg 


Ser 


Thr 


Ser Pro Glu Ala Gly Leu Gly 


Pro 


Trp 


Pro 








165 








170 








175 




Ser 


Trp 


Lys 


Gin 


Pro 


Asp 


Cys 


Leu 


Pro Thr 


Ser Tyr 


Pro 


Leu 


Asp 


Phe 








180 










185 






190 






Leu 


Tyr 


Asp 


Leu 


Tyr 


Ala 


Val 


Cys 


Asn His His Gly Asn Leu Gin Gly 




195 








200 






205 








Gly His 


Tyr 


Thr Ala Tyr Cys Arg Asn Ser Leu Asp Gly Gin Trp Tyr 




210 










215 






220 
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Ser Tyr Asp Asp 
225 

Thr Arg Gly Ala 



Ser Thr Val Glu 
230 

Tyr lie Leu Phe 
245 



Pro Leu Arg Glu 

235 

Tyr Gin Lys Arg 
250 



Asp Glu Val Asn 
240 

Asn Ser lie Pro 
255 



<210> 62 

<211> 307 

<212> PRT . 

<213> Homo sapiens 



<400> 62 
Leu Pro Pro Ala 
1 

Gly Asn Thr Cys 
20 

Pro Ala Phe lie 
35 

Leu Tyr Ser Cys 
50 

Gin Gin Asp Ala 
65 

Asp Glu Arg Asp 

Leu Thr Ser Asn 
100 

Arg Phe Asp Thr 
115 

Trp Gly His Pro 
130 

Ser Glu Ser Val 
145 

Thr Phe Val Lys 

Pro Cys Trp Leu 
180 

Thr Arg Phe Ser 
195 

Cys lie His Leu 
210 

Lys Arg His Glu 
225 

Ala Val Val Val 

Tyr Arg Arg Ser 
260 

Gin Trp Leu Trp 
275 

Glu Val Leu Ser 

290 
Ser Arg Met 
305 



Phe Phe Leu Gly 
5 

Phe Met Asn Ser 

Ser Leu Ala Leu 
40 

Ser Phe lie Ala 
55 

His Glu Leu Phe 
70 

Arg Gin Pro Arg 
85 

Met Val Cys Lys 

Phe Asp Ser Leu 
120 

Leu Thr Leu Asp 
135 

Arg Asp Val Val 
150 

Gin Leu Lys Leu 
165 

Cys Phe Glu Asp 

Arg Phe Leu Phe 
200 

Gin Arg Leu Ser 

215 

His Val His Ser 
230 

His His Gly Asp 
24 5 

Pro Pro Ser Ala 

Val Ser Asp Asp 
280 

Ser Ser Ala Tyr 
295 



Leu Val Pro Gly 
10 

Leu Leu Gin Gly 
25 

Phe lie Phe Glu 

Gin Glu Gly He 
60 

His Val He Thr 
75 

Ser Gin His Pro 
90 

His Cys Glu His 
105 

Ser Leu Ser He 

His Cys Leu His 
140 

Cys Asp Asn Cys 
155 

Gly Lys Val Ser 
170 

Ser Val Tyr Pro 
185 

His Pro Gin Leu 

Trp Ser Ser His 
220 

Ser Thr Tyr Leu 
235 

Met His Ser Gly 
250 

Arg Asn Pro Leu 
265 

Thr Val Arg Lys 

Leu Leu Phe Tyr 
300 



Leu Val Asn Leu 
15 

Leu Ser Ala Cys 
30 

Ser Leu Leu Pro 
45 

His Leu Tyr Arg 

Ser Ser Leu Glu 
80 

Phe His Gly Arg 
95 

Gin Ser Pro Val 
110 

Pro Ala Ala Thr 

125 

His Phe He Ser 

Thr Lys Arg Thr 
160 

Pro His Tyr Thr 
175 

Ala Pro Glu Thr 
190 

Pro Gin Cys Leu 
205 

Gly Thr Pro Leu 

Phe Arg Leu Met 
240 

His Phe Val Thr 
255 

Ser Thr Ser Asn 
270 

Ala Ser Leu Gin 
285 

Glu Arg Val Leu 



<210> 63 
<211> 316 
<212> PRT 

<213> Homo sapiens 



<400> 63 

Met Leu Ser Ser Pro Asp Phe Tyr Pro Ala Tyr Pro Ser Ala Met Gin 
15 10 15 
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Ser 


Leu 


Ser 


Leu 


Gly Ser Leu Ala Arg Ala 


Leu 


Glu 


Leu 


Met 


Thr 


Gin 








20 










25 










30 






Tyr 


Phe 


Asn 


Asn 


Trp 


Asn 


Trp 


Val 


Tyr Asp 


Asn 


He 


He 


Asp 


Gin 


Asn 




35 










40 










45 








Glu 


Ser 


Lys 


Leu 


Ser 


Lys 


Ser 


Arg Arg Glu Glu 


He 


Glu 


Arg 


Asp Arg 




50 










55 










60 










Glu 


Arg 


Lys 


Glu 


Arg Arg Glu Gly Asp Arg 


Glu 


Lys 


Lys 


Arg 


Gin 


Asn 


65 










70 










75 










80 


Gly Val 


Val 


Glu 


Val 


Pro 


Phe 


Leu 


Leu 


Ser 


Ser 


Lys 


Tyr 


Asp 


Glu 


Pro 










85 










90 










95 




Ser 


Arg 


Gin 


Val 


He 


Leu 


Glu 


Ala 


Leu 


Ala 


Glu 


Phe 


Glu 


Arg 


Ser 


Thr 






100 










105 










110 






Cys 


He 


Arg 


Phe 


Val 


Thr Tyr Gin Asp Gin Arg Asp 


Phe 


He 


Ser 


He 




115 










120 










125 








lie 


Pro 


Met 


Tyr 


Gly Cys 


Phe 


Ser 


Ser 


Val 


Gly Arg 


Ser 


Gly Gly Met 




130 










135 










140 










Gin 


Val 


val 


Ser 


Leu 


Ala 


Pro 


Thr 


Cys 


Leu 


Gin 


Lys 


Gly Arg Gly 


He 


145 










150 










155 










160 


Val 


Leu 


His 


Glu 


Leu 


Met 


His 


Val 


Leu Gly 


Phe 


Trp 


His 


Glu 


His 


Thr 










165 










170 










175 




Arg 


Ala 


Asp 


Arg 
180 


Asp 


Arg 


Tyr 


He 


Arg 
185 


Val 


Asn 


Trp 


Asn 


Glu 
190 


He 


Leu 


Pro Gly 


Phe 


Glu 


He 


Asn 


Phe 


He 


Lys 


Ser 


Arg 


Ser 


Ser 


Asn 


Met 


Leu 






195 










200 










205 








Thr 


Pro 


Tyr 


Asp 


Tyr 


Ser 


Ser 


Val 


Met 


His 


Tyr Gly Arg 


Leu 


Ala 


Phe 




210 










215 










220 










Ser 


Arg 


Arg 


Gly 


Leu 


Pro 


Thr 


He 


Thr 


Pro 


Leu 


Trp 


Ala 


Pro 


Ser 


Val 


225 




230 










235 










240 


His 


He 


Gly 


Gin 


Arg 


Trp 


Asn 


Leu 


Ser 


Ala 


Ser 


Asp 


He 


Thr 


Arg 


Val 








245 










250 










255 




Leu 


Gin 


Leu 


Tyr 
260 


Gly 


Cys 


Ser 


Pro 


Lys 
265 


Leu 


Glu 


Lys 


Val 


Asn 
270 


Thr 


Val 


Asn 


He 


Lys 
275 


He 


He 


Phe 


Leu 


Tyr 
280 


Thr 


Gin 


Leu 


Glu 


Asn 
285 


Thr 


He 


Val 


Ser 


Lys 
290 


His 


Thr 


He 


He 


He 
295 


Ala 


He 


He 


Lys 


Val 
300 


Pro 


Arg 


Asn 


Lys 


Ser 


Asp 


Ser 


Cys 


He 


Lys 


Tyr 


Leu 


Arg 


Lys 


Asn 


Glu 










305 










310 










315 













<210> 64 

<211> 725 

<212> PRT 

<213> Homo sapiens 



<400> 64 



Phe 


Gin 


Leu 


Trp 


He 


Trp Leu Arg 


Pro 


Cys Pro 


Val 


Thr 


Trp 


He 


Pro 


1 








5 






10 








15 




Arg 


Phe 


Pro 


Gly 


Gly Gly Val Phe 


Pro Gly Gly 


Ser 


Leu 


Ser 


Pro 


Leu 






20 






25 








30 






His 


He 


Leu 
35 


Gly 


Thr 


Lys Ala Phe 
40 


Phe 


Lys Val 


Leu 


Phe 
45 


Leu 


Asp 


His 


His 


Phe 


Arg 


Leu 


Tyr 


Met Glu His 


Ser 


Asn Asp 


He 


He 


Ser 


Pro 


His 




50 






55 






60 










Phe 


Lys 


Glu 


He 


Thr 


Gin lie He 


Thr 


Ser Phe 


Gin 


Glu 


He 


He 


Glu 


65 








70 




75 










80 


Glu 


Glu 


Phe 


Gly 


He 


Ser Gin Cys 


Tyr 


Thr Tyr Asn Asn 


Pro 


Ser 


Lys 










85 






90 








95 




Ser 


Asp 


He 


Arg 


He 


His Trp Thr 


Val 


Ser Asp 


Leu 


Ser 


Gin 


Val 


Phe 






100 






105 








110 






He 


Leu 


Ser 
115 


Arg 


Ala 


Lys Ser Leu 
120 


Tyr 


He Gin 


He 


Leu 
125 


Ser 


Gin 


Arg 
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His Ser Arg Lys Lys Arg Leu lie Ser Tyr Pro Arg Tyr lie Glu He 

130 135 140 

Met Val Thr Ala Asp Ala Lys Val Val Ser Ala His Gly Ser Asn Leu 
145 ^ 150 155 160 

Gin Asn Tyr He Leu Thr Leu Met Ser He Val Ala Thr He Tyr Lys 

165 170 175 

Asp Pro Ser He Gly Asn Leu He His He Val Val Val Lys Leu Val 

180 185 190 

Met He His Arg Glu Glu Glu Gly Pro Val lie Asn Phe Asp Gly Ala 

195 200 205 

Thr Thr Leu Lys Asn Phe Cys Ser Trp Gin Gin Thr Gin Asn Asp Leu 

210 215 220 

Asp Asp Val His Pro Ser His His Asp Thr Ala Val Leu He Thr Arg 
225 230 235 240 

Glu Asp He Cys Ser Ser Lys Glu Lys Cys Asn Met Leu Gly Leu Ser 

245 250 255 

Tyr Leu Gly Thr He Cys Asp Pro Leu Gin Ser Cys Phe He Asn Glu 

26 0 2 65 27 0 

Glu Lys Gly Leu He Ser Ala Phe Thr He Ala His Glu Leu Gly His 

275 280 285 

Thr Leu Gly Val Gin His Asp Asp Asn Pro Arg Cys Lys Glu Met Lys 

290 295 300 

Val Thr Lys Tyr His Val Met Ala Pro Ala Leu Ser Phe His Met Ser 
305 310 315 320 

Pro Trp Ser Trp Ser Asn Cys Ser Arg Lys Tyr Val Thr Glu Phe Leu 

325 330 335 

Asp Thr Gly Tyr Gly Glu Cys Leu Leu Asp Lys Pro Asp Glu Glu lie 

340 345 350 

Tyr Asn Leu Pro Ser Glu Leu Pro Gly Ser Arg Tyr Asp Gly Asn Lys 

355 360 365 

Gin Cys Glu Leu Ala Phe Gly Pro Gly Ser Gin Met Cys Pro His lie 

370 375 380 

Glu Asn He Cys Met His Leu Trp Cys Thr Ser Thr Glu Lys Leu His 
385 390 395 400 

Lys Gly Cys Phe Thr Gin His Val Pro Pro Ala Asp Gly Thr Asp Cys 

405 410 415 

Gly Pro Gly Met His Cys Arg His Gly Leu Cys Val Asn Lys Glu Thr 

420 425 430 

Glu Thr Arg Pro Val Asn Gly Glu Trp Gly Pro Trp Glu Pro Tyr Ser 

435 440 445 

Ser Cys Ser Arg Thr Cys Gly Gly Gly He Glu Ser Ala Thr Arg Arg 

450 455 460 

Cys Asn Arg Pro Glu Pro Arg Asn Gly Gly Asn Tyr Cys Val Gly Arg 
465 470 475 480 

Arg Met Lys Phe Arg Ser Cys Asn Thr Asp Ser Cys Pro Lys Gly Thr 

485 490 495 

Gin Asp Phe Arg Glu Lys Gin Cys Ser Asp Phe Asn Gly Lys His Leu 

500 505 510 

Asp He Ser Gly He Pro Ser Asn Val Arg Trp Leu Pro Arg Tyr Ser 

515 520 525 

Gly He Gly Thr Lys Asp Arg Cys Lys Leu Tyr Cys Gin Val Ala Gly 

530 ~ ^ 535 540 

Thr Asn Tyr Phe Tyr Leu Leu Lys Asp Met Val Glu Asp Gly Thr Pro 
545 550 555 560 

Cys Gly Thr Glu Thr His Asp He Cys Val Gin Gly Gin Cys Met Ala 

565 570 575 

Ala Gly Cys Asp His Val Leu Asn Ser Ser Ala Lys He Asp Lys Cys 

580 585 590 

Gly Val Cys Gly Gly Asp Asn Ser Ser Cys Lys Thr He Thr Gly Val 

595 600 605 

Phe Asn Ser Ser His Tyr Gly Tyr Asn Val Val Val Lys He Pro Ala 
610 615 620 
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Gly 


Ala 


Thr 


Asn 


Val 


Asp 


He 


Arg 


Gin 


Tyr 


Ser Tyr 


Ser 


Gly 


Gin 


Pro 


625 










630 










635 








640 


Asp 


Asp 


Ser 


Tyr 


Leu 


Ala 


Leu 


Ser 


Asp 


Ala 


V5_L Li \J-Ly 


Asn 


Phe 




Phe 










645 










650 








655 




Asn 


Gly 


Asn 


Phe 


Leu 


Leu 


Ser 




Ser 


Lys 


J_iy IS ulu 


J- -L fcr 


Asn 


Val 


Gin 








660 










665 








670 






Gly 


Thr 


Arg 


Thr 


Val 


He 


Glu 


Tyr 


Ser Gly 


oer Asn 


Asn 


Ala 


Val 


Glu 






675 










con 
D o u 








DO 3 








Arg 


He 


Asn 


Ser 


Thr 


Asn Arg 


ujLn 


Glu 


Lys 


Glu Leu 


lie 


Leu 


Gin 


Val 




690 










695 


















Leu 


Cys 


Val 


Gly Asn 


Leu 


Tyr 


Asn 


Pro Asp 


val His 


Tyr 


Ser 


Phe 


Asn 


705 










710 


















720 


He 


Pro 


Leu 


Glu 


Glu 






























725 






















<210> 65 


























<211> 700 


























<212> PRT 


























<213> Homo sapiens 






















<400> 65 


























Pro 


Ser 


Leu 


Leu 


Ser 


Cys 


Leu 


Leu 


Ser 


Phe 


Pro Arg 


Pro 


Gly Pro Asp 


1 








5 










10 








15 




He 


Ala 


Trp 


Gin 


Leu 


Ser 


Cys 


Lys 


Gly 


Ser 


Trp He 


Gly 


Thr 


Gin 


Thr 








20 










25 








30 






His 


Ser 


Leu 


Thr 


Val 


Ser 


Ser 


Ala 


Thr 


Asp 


Phe Val 


Leu 


Arg 


Trp 


Gin 






35 










40 








45 








Asn 


Arg 


Met 


Val 


Glu 


Tyr 


Pro 


Gly Val 


Pro 


Gin Met 


Pro 


Tyr Gly 


Gly 




50 










55 








60 










His 


Ser 


Ser 


Pro 


Met 


Thr 


Phe 


Leu 


Leu 


Tyr 


Gly Asp 


He 


Ala 


Asn 


Phe 


65 










70 










75 








80 


Asp 


Phe 


Ty r 


Ser 


Asn 


Leu 


Val 


Val 


Thr 


Ala 


Pro Pro 


Val 


Gly Trp 


Thr 










85 










90 








95 




Ser 


Leu 


Ser 


Ser 


Cys 


Leu 


Asp 


Leu 


Pro 


Asn 


Leu Leu 


Gly 


Leu 


Val 


Gly 








100 










105 








110 






Asp 


Gin 


Leu 


Gly 


Asp 


Thr 


Glu 


Arg 


Lys 


Arg 


Arg His 


Ala 


Lys 


Pro 


Gly 






115 










120 








125 








Ser 


Tyr 


Ser 


He 


Glu 


Val 


Leu 


Leu 


Val 


Val 


Asp Asp 


Ser 


Val 


Val 


Arg 




130 










135 








140 










Phe 


His 


Gly 


Lys 


Glu 


His 


Val 


Gin 


Asn 


Tyr 


Val Leu 


Thr 


Leu 


Met 


Asn 


145 










150 










155 








160 


He 


Val 


Asn 


Glu 


He 


Tyr 


His 


Asp 


Glu 


Ser 


Leu Gly Val 


His 


He 


Asn 










165 








170 








175 




He 


Ala 


Leu 


Val 


Arg 


Leu 


He 


Met 


Val 


Gly 


Tyr Arg 


Gin 


Ser 


Leu 


Ser 








180 










185 








190 






Leu 


He 


Glu 


Arg 


Gly 


Asn 


Pro 


Ser 


Arg 


Ser 


Leu Glu 


Gin 


Val 


Cys 


Arg 






195 










200 








205 








Trp 


Ala 


His 


Ser 


Gin 


Gin Arg Gin Asp 


Pro 


Ser His 


Ala 


Glu 


His 


His 




210 










215 








220 










Asp 


His 


Val 


Val 


Phe 


Leu Thr Arg Gin Asp 


Phe Gly 


Pro 


Ser Gly Met 


225 










230 










235 








240 


Gin 


Gly 


Tyr 


Ala 


Pro 


Val 


Thr Gly Met 


Cys 


His Pro 


Leu 


Arg 


Ser 


Cys 










245 










250 








255 




Ala 


Leu 


Asn 


His 


Glu 


Asp Gly 


Phe 


Ser 


Ser 


Ala Phe 


Val 


He 


Ala 


His 








260 










265 








270 






Glu 


Thr 


Gly 


His 


Val 


Leu Gly Met 


Glu 


His 


Asp Gly Gin 


Gly Asn Gly 






275 










280 








285 








Cys 


Ala 


Asp 


Glu 


Thr 


Ser 


Leu 


Gly Ser Val 


Met Ala 


Pro 


Leu 


Val 


Gin 




290 










295 








300 










Ala 


Ala 


Phe 


His 


Arg 


Phe 


His 


Trp 


Ser Arg 


Cys Ser 


Lys 


Leu 


Glu 


Leu 


305 










310 










315 








320 
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Ser Arg Tyr Leu 

Pro Ala Trp Pro 
340 

Asp Glu Gin Cys 
355 

Ala Phe Arg Thr 
370 

Asp Asn Pro Tyr 
385 

Thr Glu Cys Ala 

Lys Ser Pro Glu 
420 

Thr Lys Phe Gly 
435 

Arg Ser Arg Ser 
450 

Cys Leu Gly Pro 
465 

Pro Gly Thr Tyr 

Ser Tyr Tyr Val 
500 

Pro Asp Asp Asp 
515 

Thr Gly Asp Val 
530 

Cys Ser Tyr Arg 
545 

Pro Val Gly Cys 

Cys Gly Val Cys 
580 

Thr Leu Gly Lys 
595 

Pro Ala Gly Ala 
610 

His Arg lie Val 
625 

Pro Lys Gly Lys 

Glu Trp Glu Asp 
660 

Gly Pro Leu Pro 
675 

Thr Gin Asn lie 
690 



Pro Ser Tyr Asp 
325 

Gin Pro Pro Glu 

Arg Phe Asp Phe 
360 

Phe Glu Pro Cys 
375 

Phe Cys Lys Thr 
390 

Pro Gly Lys Trp 
405 

Gin Thr Tyr Gly 

Ser Cys Ser Arg 
440 

Cys Asn Asn Pro 
455 

Met Phe Glu Tyr 
470 

Glu Asp Phe Arg 
485 

His Gin Asn Ala 

Ala Gin Lys Cys 
520 

Val Phe Met Asn 
535 

Asp Pro Tyr Ser 
550 

Asp Lys Glu Val 
565 

Gly Gly Asp Asn 

Ala Ser Lys Gin 
600 

Arg His lie Gin 
615 

Val Lys Asn Gin 
630 

Glu Ala Thr Ser 
645 

Ala Val Glu Asp 

Glu Ala lie Ala 
680 

Lys Glu Pro Arg 
695 



Cys Leu Leu Asp 
330 

Leu Pro Gly lie 
345 

Gly Ser Gly Tyr 

Lys Gin Leu Trp 
380 

Lys Lys Gly Pro 
395 

Cys Phe Lys Gly 
.410 

Gin Asp Gly Gly 
425 

Ser Cys Gly Gly 

Ser Pro Ala Tyr 
460 

Gin Val Cys Asn 
475 

Ala Gin Gin Cys 
490 

Lys His Ser Trp 
505 

Glu Leu lie Cys 

Gin Val Val His 
540 

Val Cys Ala Arg 
555 

Gly Ser Met Lys 
570 

Ser His Cys Arg 
585 

Ala Ala Leu Lys 

He Glu Ala Leu 
620 

Val Thr Gly Ser 
635 

Arg Thr Phe Thr 
650 

Ala Lys Glu Ser 
665 

lie Leu Val Ser 

His Arg Pro Asp 
700 



Asp Pro Phe Asp 
335 

Asn Tyr Ser Met 
350 

Gin Thr Cys Leu 
365 

Cys Ser His Pro 

Pro Leu Asp Gly 
400 

His Cys He Trp 
415 

Trp Ser Ser Trp 
430 

Gly Val Arg Ser 
445 

Gly Gly Arg Leu 

Ser Glu Glu Cys 
480 

Ala Lys Arg Asn ' 
495 

Val Pro Tyr Glu 
510 

Gin Ser Ala Asp 
525 

Asp Gly Thr Arg 

Gly Glu Cys Val 
560 

Ala Asp Asp Lys 
575 

Thr Val Lys Gly 
590 

Leu Val Gin He 
605 

Glu Lys Ser Pro 

Phe He Leu Asn 
640 

Ala Met Gly Leu 
655 

Leu Lys Thr Ser 
670 

Pro Thr Leu Asp 
685 



<210> 66 

<211> 749 

<212> PRT 

<213> Homo sapiens 

<400> 66 

Leu Leu Leu Leu Leu Leu Leu Leu Leu Phe Leu Arg Gin Ser Trp Gin 

15 10 15 

Gly Pro He He Ser Ala Thr Gin Glu Ala Glu Ala Val Glu Ser Leu 

20 25 30 

Glu Pro Arg Arg Arg Arg Leu His Ser Gly Val Arg Asp Gin Pro Gly 
35 40 45 
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Gin His Gly Glu 
50 

His Gly Leu lie 
65 

Pro Gin Leu Leu 

His Pro His Val 
100 

Phe Ser Ser Ser 
115 

Tyr Ser Leu He 
13 0 

Pro Lys Pro Pro 
145 

Ser Ser Gly Arg 

Asn Val Glu Thr 
180 

Gly Lys Gly Asn 
195 

Ser Gly Leu Phe 
210 

Val Val Ser Leu 
225 

Asn His His Ala 

Ala Leu He Gly 
260 

Thr Gly Phe Asp 
275 

Gly Phe Ala Pro 
290 

He Asn Glu Asp 
305 

Ser Gly His Asn 

Arg Lys Ala Glu 
340 

Gly Val Phe Ser 
355 

Leu Ser Thr Pro 
370 

Gly Gin Tyr Lys 
385 

Asp Thr Gin Cys 

Leu Gly Phe Asp 
420 

Arg Cys Glu Thr 
435 

Leu Ser Met Trp 
450 

Gly Pro Arg Pro 
465 

Glu Cys Ser Arg 

Cys Asn Asn Pro 
500 

Ser Arg He Tyr 
515 

Leu Asp Phe Arg 
530 



Thr Leu Ser Leu 
55 

Arg Thr Arg Lys 
70 

Ala Gin Glu His 
85 

Leu Leu Asp Cys 

Leu Val Ser Val 
12 0 

Ser Leu Pro Asn 
135 

Thr Glu Asp Thr 
150 

Pro Arg Arg Ser 
165 

Leu Val Val Ala 

Val Thr Thr Tyr 
200 

Lys Asp Gly Thr 
215 

He Leu Leu Glu 
230 

Asp Gin Ser Leu 
245 

Lys Asn Gly Lys 

He Cys Ser Trp 
280 

He Ser Gly Met 
295 

Thr Gly Leu Gly 
310 

Phe Gly Met He 
325 

Gly Asn He Met 

Trp Ser Ser Cys 
360 

Gin Ala Gly Cys 
375 

Tyr Pro Asp Lys 
390 

Lys Trp Gin Phe 
405 

He Cys Lys Ser 

Lys Phe Met Pro 
440 

Cys Arg Gin Gly 
455 

He His Gly Gin 
470 

Thr Cys Gly Gly 
485 

Lys Pro Gin Tyr 

Gin Leu Cys Asn 
520 

Ala Gin Gin Cys 
535 



Leu Lys He Gin 
60 

Asn Glu Phe Leu 
75 

Asn Tyr Ser Ser 
90 

Phe Tyr His Cys 
105 

Ser Leu Ser Thr 

Ala Leu Leu Phe 
140 

Tyr Leu Arg Phe 
155 

Ala Gly Lys Ser 
170 

Asp Lys Lys Met 
185 

He Leu Thr Val 

He Gly Ser Asp 
220 

Gin Glu Pro Gly 
235 

Asn Ser Phe Cys 
250 

Arg His Asp His 
265 

Lys Asn Glu Pro 

Cys Ser Lys Tyr 
300 

Leu Ala Phe Thr 
315 

His Asp Gly Glu 
330 

Ser Pro Thr Leu 
345 

Ser Arg Gin Tyr 

Leu Val Asp Glu 
380 

Leu Pro Gly Gin 
395 

Gly Ala Lys Ala 
410 

Leu Trp Cys His 
425 

Ala Ala Glu Gly 

Gin Cys Val Lys 
460 

Trp Ser Ala Trp 
475 

Gly Val Lys Phe 
490 

Gly Gly Leu Phe 
505 

He Asn Pro Cys 

Ala Glu Tyr Asn 
540 



Lys Leu Asp Arg 

He Ser Pro Leu 
80 

Pro Ala Gly His 
95 

His He Lys Asp 
110 

Val Leu Ser Arg 
125 

He Val Asp Ala 

Asp Glu Tyr Gly 
160 

Gin Lys Gly Leu 
175 

Val Glu Lys His 
190 

Met Asn Met Val 
205 

He Asn Val Val 

Gly Leu Leu He 
240 

Gin Trp Gin Ser 
255 

Ala He Leu Leu 
270 

Cys Asp Thr Leu 
285 

Arg Ser Cys Thr 

He Ala His Glu 
320 

Gly Asn Pro Cys 
335 

Thr Gly Asn Asn 
350 

Leu Lys Lys Phe 
365 

Pro Lys Gin Ala 

He Tyr Asp Ala 
400 

Lys Leu Cys Ser 
415 

Arg Val Gly His 
430 

Thr Val Cys Gly 
445 

Phe Gly Glu Leu 

Ser Lys Trp Ser 
480 

Gin Glu Arg His 
495 

Cys Pro Gly Ser 
510 

Asn Glu Asn Ser 
525 

Ser Lys Pro Phe 
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Arg Gly Trp Phe 
545 

Asp Arg Cys Lys 

Ala Met Ser Gly 
580 

Asn Asp Val Cys 
595 

Glu Leu Gly Ser 
610 

Asp Asn Ser Thr 
625 

Cys Leu Thr Leu 

Ala Arg Ser He 
660 

Ala Val Arg Ser 
675 

He Asp Trp Pro 
690 

Gin Arg Ser Phe 
705 

Asn Glu Thr Leu 

Asp Leu Gly Ser 
740 



Tyr Gin Trp Lys 
550 

Leu Tyr Cys Lys 
565 

Lys Val Lys Asp 

He Asp Gly Val 
600 

Lys Ala Val Ser 
615 

Cys Lys Phe Tyr 
630 

Leu Lys Tyr Tyr 
645 

Glu He Gin Glu 

Leu Ser Gin Lys 
680 

Gly Glu Phe Pro 
695 

Asn Arg Pro Glu 
710 

Cys Cys Ser Val 
725 

Leu Gin Ala Pro 



Pro Tyr Thr Lys 
555 

Ala Glu Asn Phe 
570 

Gly Thr Pro Cys 
585 

Cys Glu Leu Val 

Asp Ala Cys Gly 
620 

Lys Gly Leu Phe 
635 

Pro Val Val Leu 
650 

Leu Gin Val Ser 
665 

Tyr Tyr Leu Thr 

Phe Ala Gly Thr 
700 

Arg Leu Tyr Ala 
71*5 

Ala Gin Ala Gly 
730 

Pro Pro Glu Phe 
745 



Val Glu Glu Glu 
560 

Glu Phe Phe Phe 
575 

Ser Pro Asn Lys 
590 

Gly Cys Asp His 
605 

Val Cys Lys Gly 

Lys Gin Phe Ser 
640 

He Pro Ala Gly 
655 

Ser Ser Tyr Leu 
670 

Gly Gly Trp Ser 
685 

Thr Phe Glu Tyr 

Pro Gly Pro Thr 
720 

Gly Gin Leu Arg 
735 

Thr 



<210> 67 
<211> 722 
<212> PRT 

<213> Homo sapiens 
<400> 67 

Leu Leu Leu Trp Arg Cys Pro Leu Ser Pro Ala Phe Pro Leu Leu Pro 

X 5 10 15 

Ser Arg Leu Arg Leu Ser Ala Pro Leu Thr Ser Pro Pro Pro Leu Cys 

20 25 30 

Ser Leu Ser Leu His Ser Pro Leu Leu Gly Pro Leu Thr Pro Pro Pro 

35 40 45 

Ser Pro Pro Pro Leu Leu Ser Pro Leu Pro Ala Pro Arg Ser Pro Thr 

50 55 60 

Ala Pro Ala Ala Pro Ala Ala Ala Ala Ala Thr Pro Pro Pro Ala Pro 
65 70 75 80 

His Gly Ala Ser Pro Leu Leu Thr Leu Leu He Ser Glu Tyr Asp Leu 

85 90 95 

Val Ser Ala Tyr Glu Val Asp His Arg Gly Asp Tyr Val Ser His Glu 

100 105 HO 

He Met His His Gin Arg Arg Arg Arg Ala Val Ala Val Ser Glu Val 

115 120 125 

Glu Ser Leu His Leu Arg Leu Lys Gly Pro Arg His Asp Phe His Met 

130 135 140 

Asp Leu Arg Thr Ser Ser Ser Leu Val Ala Pro Gly Phe He Val Gin 
145 ~ 150 155 160 

Thr Leu Gly Lys Thr Gly Thr Lys Ser Val Gin Thr Leu Pro Pro Glu 

165 170 175 

Asp Phe Cys Phe Tyr Gin Gly Ser Leu Arg Ser His Arg Asn Ser Ser 

180 185 190 

Val Ala Leu Ser Thr Cys Gin Gly Leu Ser Gly Met He Arg Thr Glu 

195 200 205 

Glu Ala Asp Tyr Phe Leu Arg Pro Leu Pro Ser His Leu Ser Trp Lys 
210 215 220 
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Leu Gly Arg Ala Ala Gin Gly Ser Ser Pro Ser His Val Leu Asn Glu 
225 230 235 240 

Glu Leu Asn Val Glu Thr Leu Val Val Val Asp Lys Lys Met Met Gin 

245 250 255 

Asn His Gly His Glu Asn He Thr Thr Tyr Val Leu Thr He Leu Asn 

260 265 270 

Met Val Ser Ala Leu Phe Lys Asp Gly Thr He Gly Gly Asn lie Asn 

275 280 285 

He Ala He Val Gly Leu He Leu Leu Glu Asp Glu Gin Pro Gly Leu 

290 295 300 

Val He Ser His His Ala Asp His Thr Leu Ser Ser Phe Cys Gin Trp 
305 310 315 320 

Gin Ser Gly Leu Met Gly Lys Asp Gly Thr Arg His Asp His Ala He 

325 330 335 

Leu Leu Thr Gly Leu Asp He Cys Ser Trp Lys Asn Glu Pro Cys Asp 

340 345 350 

Thr Leu Gly Phe Ala Pro He Ser Gly Met Cys Ser Lys Tyr Arg Ser 

355 360 365 

Cys Thr He Asn Glu Asp Thr Gly Leu Gly Leu Ala Phe Thr lie Ala 

370 375 380 

His Glu Ser Gly His Asn Phe Gly Met He His Asp Gly Glu Gly Asn 
385 390 395 400 

Met Cys Lys Lys Ser Glu Gly Asn He Met Ser Pro Thr Leu Ala Gly 

405 410 415 

Arg Asn Gly Val Phe Ser Trp Ser Pro Cys Ser Arg Gin Tyr Leu His 

420 425 430 

Lys Phe Leu Ser Thr Ala Gin Ala He Cys Leu Ala Asp Gin Pro Lys 

435 440 445 

Pro Val Lys Glu Tyr Lys Tyr Pro Glu Lys Leu Pro Gly Glu Leu Tyr 

450 455 460 

Asp Ala Asn Thr Gin Cys Lys Trp Gin Phe Gly Glu Lys Ala Lys Leu 
465 470 475 480 

Cys Met Leu Asp Phe Lys Lys Ala Thr Leu Trp Cys His Arg He Gly 

4 85 490 4 95 

Arg Lys Cys Glu Thr Lys Phe Met Pro Ala Ala Glu Gly Thr He Cys 

500 505 510 

Gly His Asp Met Trp Cys Arg Gly Gly Gin Cys Val Lys Tyr Gly Asp 

515 520 525 

Glu Gly Pro Lys Pro Thr His Gly His Trp Ser Asp Trp Ser Ser Trp 

530 1 535 540 

Ser Pro Cys Ser Arg Thr Cys Gly Gly Gly Val Ser His Arg Ser Arg 
545 550 555 560 

Leu Cys Thr Asn Pro Lys Pro Ser His Gly Gly Lys Phe Cys Glu Gly 

565 570 575 

Ser Thr Arg Thr Leu Lys Leu Cys Asn Ser Gin Lys Cys Pro Arg Asp 

580 585 590 

Ser Val Asp Phe Arg Ala Ala Gin Cys Ala Glu His Asn Ser Arg Arg 

595 600 605 

Phe Arg Gly Arg His Tyr Lys Trp Lys Pro Tyr Thr Gin Val Glu Asp 

610 615 620 

Gin Asp Leu Cys Lys Leu Tyr Cys He Ala Glu Gly Phe Asp Phe Phe 
625 630 635 640 

Phe Ser Leu Ser Asn Lys Val Lys Asp Gly Thr Pro Cys Ser Glu Asp 

645 650 655 

Ser Arg Asn Val Cys He Asp Gly He Cys Glu Arg Val Gly Cys Asp 

660 665 670 

Asn Val Leu Gly Ser Asp Ala Val Glu Asp Val Cys Gly Val Cys Asn 

675 680 685 

Gly Asn Asn Ser Ala Cys Thr He His Arg Gly Leu Tyr Thr Lys His 

690 695 700 

His His Thr Asn Arg Glu Tyr Phe Arg Ala Ala Cys Lys Pro Trp Ala 
705 710 715 720 
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Lys Lys 



<210> 68 
<211> 743 
<212> PRT 

<213> Homo sapiens 
<400> 68 

Met Ser Ser Tyr Arg Pro Arg Val lie Ser Arg Met Pro His Leu Leu 

15 10 15 

Arg Leu Leu Leu Ala Val Thr Val Ser Gin Thr Phe lie Val Phe Leu 

20 25 30 

Phe Val Phe Leu Phe Phe Val lie Leu Thr Val Leu Arg lie Thr Asp 

35 40 45 

Gin Val Ser Glu Val Cys Thr Thr Pro Gly Cys Val lie Ala Ala Ala 

50 55 60 

Ala Arg lie Leu Gin Asn Met Asp Pro Thr Thr Glu Pro Cys Asp Asp 
65 ~ 70 75 80 

Phe Tyr Gin Phe Ala Cys Gly Gly Trp Leu Arg Arg His Val He Pro 

85 90 95 

Giu Thr Asn Ser Arg Tyr Ser He Phe Asp Val Leu Arg Asp Glu Leu 

100 ~ 105 HO 

Glu Val He Leu Lys Ala Val Leu Glu Asn Ser Thr Ala Lys Asp Arg 

115 120 125 

Pro Ala Val Glu Lys Ala Arg Thr Leu Tyr Arg Ser Cys Met Asn Gin 

13 0 13 5 14 0 

Ser Val He Glu Lys Arg Gly Ser Gin Pro Leu Leu Asp He Leu Glu 
145 150 155 160 

Val Val Gly Gly Trp Pro Val Ala Met Asp Arg Trp Asn Glu Thr Val 

165 170 175 

Gly Leu Glu Trp Glu Leu Glu Arg Gin Leu Ala Leu Met Asn Ser Gin 

180 185 190 

Phe Asn Arg Arg Val Leu He Asp Leu Phe He Trp Asn Asp Asp Gin 

195 200 205 

Asn Ser Ser Arg His He He Tyr He Asp Gin Pro Thr Leu Gly Met 

210 215 220 

Pro Ser Arg Glu Tyr Tyr Phe Asn Gly Gly Ser Asn Arg Lys Val Arg 
225 ^ 230 235 240 

Glu Ala Tyr Leu Gin Phe Met Val Ser Val Ala Thr Leu Leu Arg Glu 

245 250 255 

Asp Ala Asn Leu Pro Arg Asp Ser Cys Leu Val Gin Glu Asp Met Val 

260 265 270 

Gin Val Leu Glu Leu Glu Thr Gin Leu Ala Lys Ala Thr Val Pro Gin 

275 280 285 

Glu Glu Arg His Asp Val He Ala Leu Tyr His Arg Met Gly Leu Glu 

290 ~ 295 300 

Glu Leu Gin Ser Gin Phe Gly Leu Lys Gly Phe Asn Trp Thr Leu Phe 
305 310 315 320 

He Gin Thr Val Leu Ser Ser Val Lys He Lys Leu Leu Pro Asp Glu 

325 330 335 

Glu Val Val Val Tyr Gly He Pro Tyr Leu Gin Asn Leu Glu Asn He 

340 " 345 350 

He Asp Thr Tyr Ser Ala Arg Thr He Gin Asn Tyr Leu Val Trp Arg 

355 360 365 

Leu Val Leu Asp Arg He Gly Ser Leu Ser Gin Arg Phe Lys Asp Thr 

370 " 375 380 

Arg Val Asn Tyr Arg Lys Ala Leu Phe Gly Thr Met Val Glu Glu Val 
385 390 395 400 

Arg Trp Arg Glu Cys Val Gly Tyr Val Asn Ser Asn Met Glu Asn Ala 
405 410 415 
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Val Gly Ser Leu Tyr Val Arg Glu Ala Phe Pro Gly Asp Ser Lys Ser 

420 425 430 

Met Val Arg Glu Leu He Asp Lys Val Arg Thr Val Phe Val Glu Thr 

435 440 445 

Leu Asp Glu Leu Gly Trp Met Asp Glu Glu Ser Lys Lys Lys Ala Gin 

450 455 460 

Glu Lys Ala Met Ser He Arg Glu Gin He Gly His Pro Asp Tyr He 
465 470 475 480 

Leu Glu Glu Met Asn Arg Arg Leu Asp Glu Glu Tyr Ser Asn Leu Asn 

485 490 495 

Phe Ser Glu Asp Leu Tyr Phe Glu Asn Ser Leu Gin Asn Leu Lys Val 

500 505 510 

Gly Ala Gin Arg Ser Leu Arg Lys Leu Arg Glu Lys Val Asp Pro Asn 

515 520 525 

Leu Trp He He Gly Ala Ala Val Val Asn Ala Phe Tyr Ser Pro Asn 

530 V 535 540 

Arg Asn Gin He Val Phe Pro Ala Gly He Leu Gin Pro Pro Phe Phe 
54 i 550 555 560 

Ser Lys Glu Gin Pro Gin Ala Leu Asn Phe Gly Gly He Gly Met Val 

565 570- 575 

He Gly His Glu He Thr His Gly. Phe Asp Asp Asn Gly Arg Asn Phe 

580 585 590 

Asp Lys Asn Gly Asn Met Met Asp Trp Trp Ser Asn Phe Ser Thr Gin 

595 600 605 

His Phe Arg Glu Gin Ser Glu Cys Met He Tyr Gin Tyr Gly Asn Tyr 

610 615 620 

Ser Trp Asp Leu Ala Asp Glu Gin Asn Val Asn Gly Phe Asn Thr Leu 
625 630 635 640 

Gly Glu Asn He Ala Asp Asn Gly Gly Val Arg Gin Ala Tyr Lys Ala 

645 650 655 

Tyr Leu Lys Trp Met Ala Glu Gly Gly Lys Asp Gin Gin Leu Pro Gly 

660 665 670 

Leu Asp Leu Thr His Glu Gin Leu Phe Phe He Asn Tyr Ala Gin Val 

675 680 685 

Trp Cys Gly Ser Tyr Arg Pro Glu Phe Ala He Gin Ser He Lys Thr 

690 695 700 

Asp Val His Ser Pro Leu Lys Val Leu Gly Ser Leu Gin Asn Leu Ala 
705 710 715 t 720 

Ala Phe Ala Asp Thr Phe His Cys Ala Arg Gly Thr Pro Met His Pro 

725 730 735 

Lys Glu Arg Cys Arg Val Trp 
740 



<210> 69 
<211> 371 
<212> PRT 

<213> Homo sapiens 

<400> 69 , 

Met Thr Ala Leu Asp Arg Ala Cys Leu Tyr Trp Leu Phe Leu Phe Lys. 

1 5 10 15 

Leu Leu Val He Asp He Lys Asn Asn Gly His Phe Tyr Val Thr Leu 

20 25 30 

Ala Asn Ser Lys His Leu Ser Leu Asp Phe He Val His He Thr He 

35 40 45 

Ser He Leu Val Lys Ala He Gin Arg Val Ser Arg Lys Phe Gin Thr 

50 55 60 

Phe Pro His Phe Pro Val Phe Tyr Trp Ala Leu Gin Thr He Tyr Glu 
65 70 75 8 0 

Trp Met Arg Glu He Ser Glu Lys Tyr Lys Glu Val Val Thr Gin His 
85 90 95 
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Phe Leu Gly Val Thr Tyr Glu Thr His Pro Met Tyr Tyr Leu Lys lie 

100 105 HO 

Ser Gin Pro Ser Gly Asn Pro Lys Lys He He Trp Met Asp Cys Gly 

115 120 125 

He His Ala Arg Glu Trp He Ala Pro Ala Phe Cys Gin Trp Phe Val 

130 " 135 140 

Lys Glu He Leu Gin Asn His Lys Asp Asn Ser Ser He Arg Lys Leu 
145 150 . 155 160 

Leu Arg Asn Leu Asp Phe Tyr Val Leu Pro Val Leu Asn He Asp Gly 

165 170 175 

Tyr He Tyr Thr Trp Thr Thr Asp Arg Leu Trp Arg Lys Ser Arg Ser 

180 185 190 

Pro His Asn Asn Gly Thr Cys Phe Gly Thr Asp Leu Asn Arg Asn Phe 

195 200 205 

Asn Ala Ser Trp Cys Ser He Gly Ala Ser Arg Asn Cys Gin Asp Gin 

210 215 220 

Thr Phe Cys Gly Thr Gly Pro Val Ser Glu Pro Glu Thr Lys Ala Val 
225 ^ 230 235 240 

Ala Ser Phe He Glu Ser Lys Lys Asp Asp He Leu Cys Phe Leu Thr 

245 250 255 

Met His Ser Tyr Gly Gin Leu He Leu Thr Pro Tyr Gly Tyr Thr Lys 

260 265 270 

Asn Lys Ser Ser Asn His Pro Glu Met He Gin Val Gly Gin Lys Ala 

275 280 285 

Ala Asn Ala Leu Lys Ala Lys Tyr Gly Thr Asn Tyr Arg Val Gly Ser 

290 295 300 

Ser Ala Asp He Leu Tyr Ala Ser Ser Gly Ser Ser Arg Asp Trp Ala 
305 310 315 320 

Arg Asp He Gly He Pro Phe Ser Tyr Thr Phe Glu Leu Arg Asp Ser 

325 330 335 

Glv Thr Tyr Gly Phe Val Leu Pro Glu Ala Gin He Gin Pro Thr Cys 

340 345 350 

Glu Glu Thr Met Glu Ala Val Leu Ser Val Leu Asp Asp Leu Gin Lys 

355 360 365 

Asn Pro Tyr 
370 



<210> 70 
















<211> 318 
















<212> PRT 
















<213> Homo sapiens 












<400> 70 
















Pro Asn Thr 


Gin 


Asn 


His 


Met 


Pro 


Leu 


Cys 


1 




5 










10 


Ser Tyr His 


Ser 


Gly 


Phe 


Cys 


Gin 


Asp 


Cys 


20 










25 




He Ser His 


Ser 


He 


Val 


Leu 


Pro 


Ala 


Ala 


35 










40 






Val Pro Lys 


His 


He 


Lys 


Lys 


Pro 


Asp Tyr 


50 








55 








Pro Asp Trp 


Gly 


Asp 


Ser 


He 


Glu 


Val 


Lys 


65 






70 










Gly Leu His 


Gin 


Ala 


Cys 


Gin 


Leu 


Ala 


Arg 




85 










90 


Gly Lys Ser 


Leu 


Lys 


Val 


Asp 


Met 


Thr 


Thr 


100 










105 




Val His Arg 


Glu 


He 


He 


Ser 


His 


Asn 


Ala 


115 










120 






Tyr Gly Gly 


Phe 


Pro 


Lys 


Ser 


Val 


Cys 


Thr 


130 








135 









15 

Phe Arg Arg Asn Glu Asp 
30 

Val Ser Ser Ala His Pre 
45 

Val Thr Thr Gly He Val 
60 

Asn Glu Asp Gin He Glr 
75 80 
His Val Leu Leu Leu Ale 
95 

Glu Glu He Asp Ala Lei 
110 

Tyr Pro Ser Pro Leu Gl} 
125 

Ser Val Asn Asn Val Lei 
140 
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Cys His Gly lie 
14 5 

Asn lie Asp Val 

Glu Thr Phe Leu 
180 

Glu Val Ala Arg 
195 

Gly Ala Pro Phe 
210 

Gin Asn Gly Phe 
225 

Ser Tyr Phe His 

Asp Leu Pro Met 
260 

Thr Glu Gly Ser 
275 

Val Ser Leu Asp 
290 

lie Thr Ser Arg 
305 



Pro Asp Ser Arg 
150 

Thr Val Tyr Tyr 
165 

Val Gly Asn Val 

Arg Cys Arg Asp 
200 

Ser Val lie Gly 
215 

Gin Val Cys Pro 
230 

Gly His Pro Glu 
245 

Glu Glu Gly Met 

Pro Glu Phe Lys 
280 

Asn Gin Arg Ser 
295 

Gly Ala Gin He 
310 



Pro Leu Gin Asp 

. 155 
Asn Gly Tyr His 
170 

Asp Glu Cys Gly 
185 

Glu Ala He Ala 

Asn Thr lie Ser 
220 

His Phe Val Gly 
235 

He Trp His His 
250 

Ala Phe Thr He 
265 

Val Leu Glu Asp 

Ala Gin Phe Glu 
300 

Leu Thr Lys Leu 
315 



Gly Asp He lie 
160 

Gly Asp Thr Ser 
175 

Lys Lys Leu Val 
190 

Ala Cys Arg Ala 
205 

His He Thr His 

His Gly He Gly 
240 

Ala Asn Asp Ser 
255 

Glu Pro He He 
270 

Ala Trp Thr Val 
285 

His Thr Val Leu 
Pro His 



<210> 71 
<211> 237 
<212> PRT 

<213> Homo sapiens 



<400> 71 



Arg 


He 


Val 


Gly 


Gly 


Met 


Glu Ala Ser 


Pro 


Gly Glu Phe 


Pro 


Trp 


Gin 


1 








5 






10 










15 




Ala 


Ser 


Leu 


Arg 


Glu 


Asn 


Lys Glu His 


Phe 


Cys 


Gly Ala Ala 


He 


He 








20 






25 










30 






Asn 


Ala 


Arg 


Trp 


Leu 


Val 


Ser Ala Ala 


His 


Cys 


Phe 


Asn 


Glu 


Phe 


Gin 






35 








40 








45 








Asp 


Pro 


Thr 


Lys 


Trp 


Val 


Ala Tyr Val 


Gly Ala 


Thr 


Tyr 


Leu 


Ser 


Gly 


50 










55 






60 










Ser 


Glu 


Ala 


Ser 


Thr 


Val 


Arg Ala Gin 


Val 


Val 


Gin 


He 


Val 


Lys 


His 


65 










70 




75 










80 


Pro 


Leu 


Tyr 


Asn 


Ala 


Asp 


Thr Ala Asp 


Phe 


Asp 


Val 


Ala 


Val 


Leu 


Glu 








85 






90 










95 




Leu 


Thr 


Ser 


Pro 


Leu 


Pro 


Phe Gly Arg 


His 


He 


Gin 


Pro 


Val 


Cys 


Leu 








100 






105 










110 






Pro 


Ala 


Ala 


Thr 


His 


He 


Phe Pro Pro 


Ser 


Lys 


Lys 


Cys 


Leu 


He 


Ser 






115 








120 








125 








Gly 


Trp 


Gly Tyr 


Leu 


Lys 


Glu Asp Phe 


Leu 


Val 


Lys 


Pro 


Glu 


Val 


Leu 




130 










13 5 






140 










Gin 


Lys 


Ala 


Thr 


Val 


Glu 


Leu Leu Asp 


Gin 


Ala 


Leu 


Cys 


Ala 


Ser 


Leu 


145 








150 






155 










160 


Tyr 


Gly His 


Ser 


Leu 


Thr Asp Arg Met 


Val 


Cys Ala Gly Tyr 


Leu 


Asp 








165 






170 










175 




Gly 


Lys 


Val 


Asp 


Ser 


Cys 


Gin Gly Asp 


Ser Gly Gly 


Pro 


Leu 


Val 


Cys 






180 






185 










190 






Glu 


Glu 


Pro 


Ser 


Gly Arg 


Phe Phe Leu 


Ala 


Gly 


He 


Val 


Ser Trp Gly 






195 








200 








205 








He 


Gly 


Cys 


Ala 


Glu 


Ala 


Arg Arg Pro Gly 


Val 


Tyr Ala Arg 


Val 


Thr 




210 










215 






220 










Arg 


Leu 


Arg Asp 


Trp 


He 


Leu Glu Ala 


Thr 


Thr Lys Ala 








225 










230 






235 
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<210> 72 
<211> 238 
<212> PRT 

<213> Homo sapiens 
<400> 72 

Arg lie Val Gly Gly Ala Val Ser Ser Glu Gly Glu Trp Pro Trp Gin 

15 10 15 

Ala Ser Leu Gin Val Arg Gly Arg His lie Cys Gly Gly Ala Leu He 

20 25 30 

Ala Asp Arg Trp Val He Thr Ala Ala His Cys Phe Gin Glu Asp Ser 

35 40 45 

Met Ala Ser Thr Val Leu Trp Thr Val Phe Leu Gly Lys Val Trp Gin 

50 55 60 

Asn Ser Arg Trp Pro Gly Glu Val Ser Phe Lys Val Ser Arg Leu Leu 
65 70 75 80 

Leu His Pro Tyr His Glu Glu Asp Ser His Asp Tyr Asp Val Ala Leu 

85 90 95 

Leu Gin Leu Asp His Pro Val Val Arg Ser Ala Ala Val Arg Pro Val 

100 105 HO 

Cys Leu Pro Ala Arg Ser His Phe Phe Glu Pro Gly Leu His Cys Trp 

115 120 125 

He Thr Gly Trp Gly Ala Leu Arg Glu Gly Gly Pro He Ser Asn Ala 

13 0 13 5 14 0 

Leu Gin Lys Val Asp Val Gin Leu He Pro Gin Asp Leu Cys Ser Glu 
145 ~ 150 155 160 

Val Tyr Arg Tyr Gin Val Thr Pro Arg Met Leu Cys Ala Gly Tyr Arg 

165 170 175 

Lys Gly Lys Lys Asp Ala Cys Gin Gly Asp Ser Gly Gly Pro Leu Val 

180 185 190 

Cys Lys Ala Leu Ser Gly Arg Trp Phe Leu Ala Gly Leu Val Ser Trp 

195 200 205 

Gly Leu Gly Cys Gly Arg Pro Asn Tyr Phe Gly Val Tyr Thr Arg He 

210 215 220 

Thr Gly Val He Ser Trp He Gin Gin Thr Met Ala Gin Ser 
225 230 235 



<210> 73 
<211> 235 
<212> PRT 

<213> Homo sapiens 



<400> 73 
















Arg 


He 


Val 


Gly 


Gly 


Ser Ala Ala Gly Arg Gly Glu 


Trp 


Pro 


Trp 


Gin 


1 






5 


10 






15 




Val 


Ser 


Leu 


Trp 


Leu 


Arg Arg Arg Glu His Arg Cys 


Gly Ala 


Val 


Leu 








20 




25 




30 






Val 


Ala 


Glu 


Arg 


Trp 


Leu Leu Ser Ala Ala His Cys 


Phe 


Asp 


Val 


Tyr 






35 






40 


45 








Gly 


Asp 


Pro 


Lys 


Gin 


Trp Ala Ala Phe Leu Gly Thr 


Pro 


Phe 


Leu 


Ser 




50 








55 60 










Gly 


Ala 


Glu 


Gly 


Gin 


Leu Glu Arg Val Ala Arg He 


Tyr 


Lys 


His 


Pro 


65 








70 75 








80 


Phe 


Tyr 


Asn 


Leu 


Tyr 


Thr Leu Asp Tyr Asp Val Ala 


Leu 


Leu 


Glu 


Leu 








85 


' 90 






95 




Ala 


Gly 


Pro 


Val 


Arg Arg Ser Arg Leu Val Arg Pro 


He 


Cys 


Leu 


Pro 








100 




105 




110 






Glu 


Pro 


Ala 


Pro 


Arg 


Pro Pro Asp Gly Thr Arg Cys 


Val 


He Thr Gly 






115 




12 0 


125 








Trp 


Gly 


Ser 


Val 


Arg Glu Gly Gly Ser Met Ala Arg Gin Leu Gin Lys 


130 








135 140 
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Ala Ala Val Arg Leu Leu Ser Glu 
145 150 
Val Gin lie Ser Ser Arg Met Leu 
165 

Val Asp Ser Cys Ser Gly Asp Ala 
180 

Pro Ser Gly Arg Trp Val Leu Thr 
195 200 
Cys Gly Arg Pro His Phe Pro Gly 

210 215 
Arg Gly Trp He Gly Gin His He 
225 230 



Gin Thr Cys Arg Arg Phe Tyr Pro 
155 160 
Cys Ala Gly Phe Pro Gin Gly Gly 

170 175 
Gly Gly Pro Leu Ala Cys Arg Glu 
185 190 
Gly Val Thr Ser Trp Gly Tyr Gly 
205 

Val Tyr Thr Arg Val Ala Ala Val 

.220 

Gin Asp Asn 
235 



<210> 74 
<211> 238 
<212> PRT 

<213> Homo sapiens 
<400> 74 

Arg He He Gly Gly Thr Asp Thr Leu Glu Gly Gly Trp Pro Trp Gin 

1 5 10 15 

Val Ser Leu His Phe Val Gly Ser Ala Tyr Cys Gly Ala Ser Val He 

20 25 30 

Ser Arg Glu Trp Leu Leu Ser Ala Ala His Cys Phe His Gly Asn Arg 

35 40 45 

Leu Ser Asp Pro Thr Pro Trp Thr Ala His Leu Gly Met Tyr Val Gin 

50 55 60 

Gly Asn Ala Lys Phe Val Ser Pro Val Arg Arg He Val Val His Glu 
65 " 70 75 80 

Tyr Tyr Asn Ser Gin Thr Phe Asp Tyr Asp He Ala Leu Leu Gin Leu 

85 90 95 

Ser He Ala Trp Pro Glu Thr Leu Lys Gin Leu He Gin Pro He Cys 

100 105 HO 

He Pro Pro Thr Gly Gin Arg Val Arg Ser Gly Glu Lys Cys Trp Val 

115 120 125 

Thr Gly Trp Gly Arg Arg His Glu Ala Asp Asn Lys Gly Ser Leu Val 

130 135 140 

Leu Gin Gin Ala Glu Val Glu Leu He Asp Gin Thr Leu Cys Val Ser 
145 150 155 160 

Thr Tyr Gly He He Thr Ser Arg Met Leu Cys Ala Gly He Met Ser 

165 170 175 

Gly Lys Arg Asp Ala Cys Lys Gly Asp Ser Gly Gly Pro Leu Ser Cys 

180 185 190 

Arg Arg Lys Ser Asp Gly Lys Trp He Leu Thr Gly He Val Ser Trp 

195 200- 205 

Gly His Gly Ser Gly Arg Pro Asn Phe Pro Gly Val Tyr Thr Arg Val 

210 215 220 

Ser Asn Phe Val Pro Trp He His Lys Tyr Val Pro Ser. Leu 
225 230 235 



<210> 75 
<211> 235 
<212> PRT 

<213> Homo sapiens 
<400> 75 

Arg He Val Gin Gly Arg Glu Thr Ala Met Glu Gly Glu Trp Pro Trp 

1 5 10 15 

Gin Ala Ser Leu Gin Leu He Gly Ser Gly His Gin Cys Gly Ala Ser 
20 25 30 
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Leu He Ser Asn Thr Trp Leu Leu Thr Ala Ala His Cys Phe Trp Lys 

35 40 45 

Asn Lys Asp Pro Thr Gin Trp He Ala Thr Phe Gly Ala Thr He Thr 

50 55 6 0 

Pro Pro Ala Val Lys Arg Asn Val Arg Lys He He Leu His Glu Asn 
65 70 75 80 

Tyr His Arg Glu Thr Asn Glu Asn Asp He Ala Leu Val Gin Leu Ser 

85 90 95 

Thr Gly Val Glu Phe Ser Asn He Val Gin Arg Val Cys Leu Pro Asp 

100 105 HO 

Ser Ser He Lys Leu Pro Pro Lys Thr Ser Val Phe Val Thr Gly Phe 

115 120 125 

Gly Ser He Val Asp Asp Gly Pro He Gin Asn Thr Leu Arg Gin Ala 

130 135 140 

Arg Val Glu Thr He Ser Thr Asp Val Cys Asn Arg Lys Asp Val Tyr 
145 150 155 160 

Asp Gly Leu He Thr Pro Gly Met Leu Cys Ala Gly Phe Met Glu Gly 

165 170 175 

Lys He Asp Ala Cys Lys Gly Asp Ser Gly Gly Pro Leu Val Tyr Asp 

180 "* 185 190 

Asn His Asp He Trp Tyr He Val Gly He Val Ser Trp Gly. Gin Ser 

195 200 205 

Cys Ala Leu Pro Lys Lys Pro Gly Val Tyr Thr Arg Val Thr Lys Tyr 

210 215 220 

Arg Asp Trp He Ala Ser Lys Thr Gly Met Asn 
225 ~ 230 235 



<210> 76 

<211> 230 

<212> PRT 

<213> Homo sapiens 

<400> 76 

Arg He Val Gly Gly Ser Ala Ala Pro Pro Gly Ala Trp Pro Trp Leu 

1 5 10 15 

Val Arg Leu Gin Leu Gly Gly Gin Pro Leu Cys Gly Gly Val Leu Val 

20 25 30 

Ala Ala Ser Trp Val Leu Thr Ala Ala His Cys Phe Leu Leu Trp Thr 

35 ^ 40 45 

Val Thr Leu Ala Glu Gly Ser Arg Gly Glu Gin Ala Glu Glu Val Pro 

50 55 60 

Val Asn Arg He Leu Pro His Pro Lys Phe Asp Pro Arg Thr Phe His 
65 ~ 70 75 80 

Asn Asp Leu Ala Leu Val Gin Leu Trp Thr Pro Val Ser Pro Gly Gly 

85 90 95 

Ser Ala Arg Pro Val Cys Leu Pro Gin Glu Pro Gin Glu Pro Pro Ala 

100 105 HO 

Gly Thr Ala Cys Ala He Ala Gly Trp Gly Ala Leu Phe Glu Asp Gly 

115 120 125 

Pro Glu Ala Glu Ala Val Arg Glu Ala Arg Val Pro Leu Leu Ser Thr 

130 135 140 

Asp Thr Cys Arg Arg Ala Leu Gly Pro Gly Leu Arg Pro Ser Thr Met 
145 " 150 155 160 

Leu Cys Ala Gly Tyr Leu Ala Gly Gly Val Asp Ser Cys Gin Gly Asp 

165 170 175 

Ser Gly Gly Pro Leu Thr Cys Ser Glu Pro Gly Pro Arg Pro Arg Glu 

180 185 190 

Val Leu Phe Gly Val Thr Ser Trp Gly Asp Gly Cys Gly Glu Pro Gly 

195 200 205 

Lys Pro Gly Val Tyr Thr Arg Val Ala Val Phe Lys Asp Trp Leu Gin 
210 215 220 



WO 03/040393 



PCT/IB02/04615 



45/58 



Glu Gin Met Ser Gly Glu 
225 230 



<210> 77 
<211> 233 
<212> PRT 

<213> Homo sapiens 



Arq°ile 7 Ala Ser Gly Val lie Ala Pro Lys Ala Ala Trp *Pro Trp Gin 

1 5 10 15 

Ala Ser Leu Gin Tyr Asp Asn lie His Gin Cys Gly Ala Thr Leu He 

20 25 30 

Ser Asn Thr Trp Leu Val Thr Ala Ala His Cys Phe Gin Lys Tyr Lys 

35 40 45 

Asn Pro His Gin Trp Thr Val Ser Phe Gly Thr Lys -He Asn Pro Pro 

50 55 60 

Leu Met Lys Arg Asn Val Arg Arg Phe He He His Glu Lys Tyr Arg 
65 " 70 75 80 

Ser Ala Ala Arg Glu Tyr Asp He Ala Val Val Gin Val Ser Ser Arg 

85 90 95 

Val Thr Phe Ser Asp Asp He Arg Gin He Cys Leu Pro Glu Ala Ser 

100 105 HO 

Ala Ser Phe Gin Pro Asn Leu Thr Val His He Thr Gly Phe Gly Ala 

115 120 125 

Leu Tyr Tyr Gly Gly Glu Ser Gin Asn Asp Leu Arg Glu Ala Arg Val 

130 135 140 

Lys He He Ser Asp Asp Val Cys Lys Gin Pro Gin Val Tyr Gly Asn 
145 150 155 

Asp He Lys Pro Gly Met Phe Cys Ala Gly Tyr Met Glu Gly He Tyr 

165 170 175 

Asp Ala Cys Arg Gly Asp Ser Gly Gly Pro Leu Val Thr Arg Asp Leu 

180 185 190 

Lys Asp Thr Trp Tyr Leu He Gly He Val Ser Trp Gly Asp Asn Cys 

195 200 205 

Gly Gin Lys Asp Lys Pro Gly Val Tyr Thr Gin Val Thr Tyr Tyr Arg 

210 ~ 215 220 

Asn Trp He Ala Ser Lys Thr Gly He 
225 230 

<210> 78 
<211> 247 
<212> PRT 

<213> Homo sapiens 

Arq°ile 8 He Gly Gly Thr Glu Ala Gin Ala Gly Ala Trp Pro Trp Val 

1 5 10 15 

Val Ser Leu Gin He Lys Tyr Gly Arg Val Leu Val His Val Cys Gly 

20 25 30 

Gly Thr Leu Val Arg Glu Arg Trp Val Leu Thr Ala Ala His Cys Thr 

35 40 45 

Lys Asp Ala Ser Asp Pro Leu Met Trp Thr Ala Val He Gly Thr Asn 
50 55 60 



Asn He His Gly Arg Tyr Pro His Thr Lys Lys He Lys He Lys Ala 
65 70 75 

e He He His Pro Asn Phe He Leu Glu Se3 

85 90 95 

Ala Leu Phe His Leu Lys Lys Ala Val Arg Tyr Asn Asp Tyr He Gin 
100 105 11° 



65 70 75 

lie He He His Pro Asn Phe He Leu Glu Ser Tyr Val Asn Asp He 

on 9 5 
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Pro He Cys Leu Pro Phe Asp Val Phe Gin He Leu Asp Gly Asn Thr 

115 120 125 

Lys Cys Phe He Ser Gly Trp Gly Arg Thr Lys Glu Glu Gly Asn Tyr 

130 . 135 140 

Gly Asn Ala Thr Asn He Leu Gin Asp Ala Glu Val His Tyr He Ser 
145 150 155 160 

Arq Glu Met Cys Asn Ser Glu Arg Ser Tyr Gly Gly He He Pro Asn 

165 170 175 

Thr Ser Phe Cys Ala Gly Asp Glu Asp Gly Ala Phe Asp Thr Cys Arg 

180 185 190 

Gly Asp Ser Gly Gly Pro Leu Met Cys Tyr Leu Pro Glu Tyr Lys Arg 

195 200 205 

Phe Phe Val Met Gly He Thr Ser Tyr Gly His Gly Cys Gly Arg Arg 

210 215 220 

Gly Phe Pro Gly Val Tyr He Gly Pro Ser Phe Tyr Gin Lys Trp Leu 
225 * 230 235 240 

Thr Glu His Phe Phe His Ala 
245 



<210> 79 
<211> 253 
<212> PRT 

<213> Homo sapiens 
<400> 79 

Arg He Ser Ser Trp Arg Asn Ser Thr Val Thr Gly His Pro Trp Gin 

1 5 10 15 

Val Ser Leu Lys Ser Asp Glu His His Phe Cys Gly Gly Ser Leu He 

20 25 30 

Gin Glu Asp Arg Val Val Thr Ala Ala His Cys Leu Asp Ser Leu Ser 

35 40 45 

Glu Lys Gin Leu Lys Asn He Thr Val Thr Ser Gly Glu Tyr Ser Leu 

50 55 60 

Phe Gin Lys Asp Lys Gin Glu Gin Asn He Pro Val Ser Lys He He 
65 70 75 80 

Thr His Pro Glu Tyr Asn Ser Arg Glu Tyr Met Ser Pro Asp He Ala 

85 90 95 

Leu Leu Tyr Leu Lys His Lys Val Lys Phe Gly Asn Ala Val Gin Pro 

100 105 HO 

He Cys Leu Pro Asp Ser Asp Asp Lys Val Glu Pro Gly He Leu Cys 

115 120 125 

Leu Ser Ser Gly Trp Gly Lys He Ser Lys Thr Ser Glu Tyr Ser Asn 

130 135 140 

Val Leu Gin Glu Met Glu Leu Pro He Met Asp Asp Arg Ala Cys Asn 
145 150 155 160 

Thr Val Leu Lys Ser Met Asn Leu Pro Pro Leu Gly Arg Thr Met Leu 

165 170 175 

Cvs Ala Gly Phe Pro Asp Trp Gly Met Asp Ala Cys Gin Gly Asp Ser 

180 185 190 

Gly Gly Pro Leu Val Cys Arg Arg Gly Gly Gly He Trp He Leu Ala 

195 200 205 

Gly He Thr Ser Trp Val Ala Gly Cys Ala Gly Gly Ser Val Pro Val 

210 215 220 

Arg Asn Asn His Val Lys Ala Ser Leu Gly He Phe Ser Lys Val Ser 
225 230 235 240 

Glu Leu Met Asp Phe He Thr Gin Asn Leu Phe Thr Gly 
245 250 



<210> 80 
<211> 241 
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<212> PRT 

<213> Homo sapiens 

<400> 80 * 

Glu lie Trp Ser Gly Glu Gin Gly Gin Asn Asp Met Val Trp Leu Ser 

1 5 io 15 

Ser Leu Lys Met Ser Gly Gin His Tyr Cys Gly Ala Ser Leu He Ser 

20 25 30 

Glu Arg His Leu Val Thr Ala Ala His Cys Phe Lys Val Thr Lys Asn 

35 40 45 

(Pro Lys Asn Tyr Thr Val Ser Phe Gly Thr Lys Val Thr Leu Pro Tyr 

50 55 60 

Met Gin His Asp Val Gin Gin He He He His Glu Asp Tyr He Gin 
65 70 75 80 

Asp Glu His His Asp Asp He Ala Leu He Leu Leu Thr Lys Lys Val 

85 90 95 

Leu Phe Lys Asn Asp Val His Arg Val Cys Leu Pro Glu Ala Thr Gin 

100 105 HO 

He Phe Pro Pro Gly Glu Gly Val Val Val Thr Gly Trp Gly Arg Leu 

115 120 125 

Ser Phe Asn Gly Lys He Ser Glu Asn Leu Thr Tyr His Lys Ala Ser 

130 135 140 

Val Lys He Thr Asp Thr Asn Thr Cys Asn Ala Lys Glu Ala Tyr Arg 
145 150 155 160 

Ser Met Val Gin Asp Arg Val Leu Cys Ala Gly Tyr Met Glu Gly Asn 

165 " 170 175 

He Asp Ala Cys Gin Gly Asp Ser Gly Gly Pro Leu Val His Pro Asn 

180 185 190 

Ser Leu Asn He Trp Tyr He Trp Tyr Leu Val Gly Val Val Ser Trp 

195 200 205 

Gly Arg Asn Glu Cys Gly Ala He Asn Ser Pro Gly Val Tyr Thr Gin 

210 215 220 

Thr Asp Val Phe Phe Phe Leu Lys Trp He Lys Ser Thr He Ala Leu 
225 230 235 240 

Lys 



<210> 81 
<211> 231 
<212> PRT 

<213> Homo sapiens 

<400> 81 T7 
Arq He Val Ser Met Glu Ser Lys Lys Gly Lys Val Gin Trp Leu Val 

x y 5 10 15 

Val Leu Phe Gly Ser Ser Ser He Gin Gly Ser Arg Lys Asp Lys Ala 

20 25 30 

He Lys Thr Trp Thr Thr Phe Ser Tyr Thr Val Trp Leu Gly Ser He 

35 40 45 

Thr Val Gly Asp Ser Arg Lys Arg Val Lys Tyr Tyr Val Ser Lys He 

50 55 60 

Val He His Pro Lys Tyr Gin Asp Thr Thr Ala Asp Val Ala Leu Leu 
65 70 75 80 

Lys Leu Ser Ser Gin Val Thr Phe Thr Ser Ala He Leu Pro He Cys 

85 90 95 

Leu Pro Ser Val Thr Lys Gin Leu Ala He Pro Pro Phe Cys Trp Val 

100 105 * HO 

Thr Gly Trp Gly Lys Val Lys Glu Ser Ser Asp Arg Asp Tyr His Ser 

115 120 125 

Ala Leu Gin Glu Ala Glu Val Pro He He Asp Arg Gin Ala Cys Glu 
130 135 140 
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Gin 


Leu 


Tyr Asn 


Pro 


He 


Gly 


He 


145 






150 






lie 


Lys 


Glu Asp 


Lys 


He 


Cys 


Ala 








165 








Ser 


Cys 


Lys Gly 


Asp 


Ser 


Gly Gly 






180 










Val 


Trp 


He Gin 


Thr Gly Val 


Val 




195 








200 


Ser 


Leu 


Pro Gly 


Val 


Tyr 


Thr 


Asn 




210 








215 




Asn 


Ala 


Thr He 


Ser 


Arg 


Ala 




225 








230 







Phe Leu Pro Ala Leu Glu Pro Val 
155 160 
Gly Asp Thr Gin Asn Met Lys Asp 

170 175 
Pro Leu Ser Cys His He Asp Gly 
185 190 
Ser Trp Gly Leu Glu Cys Gly Lys 
205 

Val He Tyr Tyr Gin Lys Trp He 
220 



<210> 82 




























<211> 223 




























<212> PRT 




























<213> Homo sapiens 
























<400> 82 


























Leu 


Leu Ala Phe 


Asn 


Pro 


Asp 


Tyr 


Thr 


Val 


Ser 


Ser 


Thr 


Pro 


Pro 


Tyr 


1 




5 








10 










15 




Val Tyr Leu 


Lys 


Ser 


Asp 


Tyr 


Leu 


Pro 


Cys 


Ala Gly Val 


Leu 


He 


His 


20 










25 










30 






Pro Leu Trp 


Val 


He 


Thr 


Ala 


Ala 


His 


Cys 


Asn 


Leu 


Pro 


Lys 


Leu 


Arg 


35 










40 










45 




His 




Val He Leu Gly 


Val 


Thr 


He 


Pro 


Ala 


Asp 


Ser 


Asn 


Glu 


Lys 


Leu 


50 








55 










60 








Val 


Gin Val He 


Gly 


Tyr 


Glu 


Lys 


Met 


He 


His 


His 


Pro 


His 


Phe 


Ser 


65 


70 










75 








Glu 


80 


Thr Ser He 


Asp 


His 


Asp 


He 


Met 


Leu 


He 


Lys 


Leu 


Lys 


Thr 


Ala 




85 










90 










95 




Glu Leu Asn 


Asp 


Tyr 


Val 


Lys 


Leu 


Ala 


Asn 


Leu 


Pro 


Tyr 


Gin 


Thr 


He 




100 










105 










110 


Val 




Ser Glu Asn 


Thr 


Met 


Cys 


Ser 


Val 


Ser 


Thr 


Trp 


Ser 


Tyr 


Asn 


Tyr 


115 










12 0 










125 








Lys Glu Pro 


Asp 


Ser 


Leu 


Gin 


Thr 


Val 


Asn 


He 


Ser 


Val 


He 


Ser 


Lys 


130 








135 










140 










Pro Gin Cys 


Arg 


Asp 


Ala 


Tyr 


Lys 


Thr 


Tyr 


Asn 


He 


Thr 


Glu 


Asn 


Met 


145 






150 










155 










160 


Leu Cys Val 


Gly 


He 


Val 


Pro 


Gly Arg Arg 


Gin 


Pro 


Cys 


Lys 


Glu 


Val 


165 










170 










175 




Ser Ala Ala 


Pro 


Ala 


He 


Cys 


Asn 


Gly 


Met 


Leu Gin Gly 


lie 


Leu 


Ser 




180 










185 










190 


Ala 




Phe Ala Asp Gly 


Cys 


Val 


Leu 


Arg 


Ala 


Asp 


Val 


Gly 


He 


Tyr 


Lys 


195 










200 










205 








He Phe Tyr 


Tyr 


He 


Pro 


Trp 


He 


Glu 


Asn 


Val 


He 


Gin 


Asn 


Asn 




210 






215 










220 











<210> 83 
<211> 223 
<212> PRT 

<213> Homo sapiens 

<400> 83 , „ 

Arg Trp Ala Ala Gly Val Arg Val Pro Ala Gin His Ser Glu Glu Pro 

1 5 10 15 

Pro His Asn Arg Ser Thr Asn Pro Ser Asp Tyr Arg He Leu Leu Gly 

20 25 30 

Tyr Asp Gin Gin Ser His Pro Thr Glu His Ser Lys Gin Met Thr Val 
35 40 45 
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Asn 


Lys 


lie 


Met 


Val 


His 


Ala Asp 


Tyr Asn Glu Leu His Arg Met 


Gly 




50 










55 


60 




Ser 


Asp 


He 


Thr 


Leu 


Leu 


Gin Leu 


His His His Val Glu Phe Ser 


Ser 


65 








70 




75 


80 


His 


lie 


Leu 


Pro 


Ala 


Cys 


Leu Pro 


Glu Pro Thr Thr Trp Leu Ala 


Pro 










85 




90 95 




Asp 


Ser 


Ser 


Cys 


Trp 


He 


Ser Gly 


Trp Gly Met Val Thr Glu Asp 


Val 






100 








105 HO 




Phe 


Leu 


Pro 


Glu 


Pro 


Phe 


Gin Leu 


Gin Glu Ala Glu Val Gly Val 


Met 






115 








12 0 


125 




Asp 


Asn 


Thr 


Val 


Cys 


Gly Ser Phe 


Phe Gin Pro Gin Tyr Pro Gly Gin 


130 










135 


140 




Pro 


Ser 


Ser 


Ser 


Asp 


Tyr 


Thr He 


His Glu Asp Met Leu Cys Ala 


Gly 


145 








150 




155 


160 


Asp 


Leu 


He 


Thr 


Gly 


Lys 


Ala He 


Cys Arg Arg Asp Ser Arg Gly 


Pro 








165 






170. 175 




Leu 


Val 


Cys 


Pro 


Leu Asn Gly Thr 


Trp Phe Leu Met Gly Leu Ser 


Ser 






180 








185 190 




Trp 


Ser 


Leu 


Asp 


Cys 


Cys 


Ser Pro 


Val Gly Pro Arg Val Phe Thr 


Arg 




195 








200 


205 




Leu 


Pro 


Tyr 


Phe 


Thr 


Asn 


Trp He 


Ser Gin Lys Lys Arg Glu Ser 






210 








215 


220 





<210> 84 




















<211> 203 




















<212> PRT 




















<213> Homo sapiens 


















<400> 84 




















Arg Val Val 


Ser Gly Tyr Phe Ser 


Ala 


Asn 


Met 


Val 


Ser 


Thr 


Pro 


Trp 


1 


5 




10 










15 




Arg Thr Gly 


He Leu His Phe Asn 


His 


Cys 


He 


His 


Asp 


Leu 


Ser 


Gin 


20 


25 










30 






Thr Val Leu 


Gly Asp His Leu Val 


Lys 


Phe 


His 


His 


Thr 


He 


Lys 


He 


35 


40 










45 








He. Cys His 


He Leu Asp His Ala 


Val 


Ala 


Leu 


Leu 


Phe 


Leu 


Gin 


He 


50 


55 








60 










Ser Ser He 


Trp Asn Gly Asn He 


Tyr 


Pro 


He 


Pro 


Leu 


Pro 


Ala 


Phe 


65 


70 






75 










80 


Val Ser Tyr 


Lys Asn' Ala Ser He 


Cys 


Arg 


He 


Met 


Leu 


Trp 


Gly His 


85 




90 










95 




Ala Gly Asp 


Met Leu Phe Pro Met 


Asn 


Phe 


Pro 


Leu 


Cys 


Ala 


Arg 


Val 


100 


105 










110 






Asp Arg Gin 


Gin Gly Glu Gin Cys 


Glu 


His 


Thr 


Glu 


Phe 


Gly 


Tyr 


Gin 


115 


120 










125 








Pro Glu Thr 


He Lys Asn Asp Met 


Leu 


Cys 


Ala 


Gly 


Phe 


Glu 


Glu 


Gly 


13 0 


135 








14 0 










Lys Lys Asp 


Ala Cys Lys Gly Asp 


Ser Gly Gly 


Pro 


Leu 


Val 


Cys 


Leu 


145 


150 






155 










160 


Val Gly Gin 


Ser Trp Leu Gin Ala Gly Val 


He Ser Trp Gly Glu Gly 


165 




170 










175 




Cys Ala Arg 


Gin Asn Arg Pro Gly Val 


Tyr 


He 


Arg 


Val 


Thr 


Ala 


His 


180 


185 










190 






His Asn Trp 


He His Arg He He 


Pro 


Lys 


Leu 












195 


200 



















<210> 85 

<211> 235 

<212> PRT 

<213> Homo sapiens 
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<400> 85 
His He He Asn 
1 

Thr Arg Glu Gly 
20 

Thr Pro Pro Trp 
35 

Lys Lys Glu Thr 
50 

Asp Gly His Tyr 
65 

Lys Met Trp Arg 

He Lys His Arg 
100 

Pro Ser Thr Leu 
115 

His Tyr Val Ala 
130 

Gin Leu Pro Leu 
145 

Met Ser Tyr He 

Ala Lys Thr Val 
180 

Phe Asn Arg Ser 

195 . 
Cys Ser Asn Pro 
210 

Ser Lys Trp He 
225 



Gly Lys Arg Gin 
5 

Cys Pro Leu Leu 

Ala Thr Glu Gin 
40 

Glu Lys Thr He 
55 

Lys Glu Asn Lys 
70 

Asn Trp Asn Leu 
85 

He Thr Glu Pro 

Gly Gly Arg Gly 
120 

Gin Ala Gly Glu 
135 

He Leu Glu Pro 
150 

Met Pro Asp Met 
165 

Cys Glu Gly Asp 

Trp Leu Gin lie 
200 

Leu Tyr Pro Gly 
215 

Cys Asp Asn He 
230 



He Ala Phe Pro 
10 

Leu Phe Leu Ser 
25 

Asp Ser Asn Ser 

Pro Lys Ala Thr 
60 

Asn Arg Lys His 

75- 

Tyr Ala Leu Leu 
90 

Gly Arg Val Ala 
105 

Gly Trp He Thr 

Thr Ser Asp Glu 
140 

Trp Cys His Leu 
155 

Leu Cys Ala Gly 
170 

Ser Gly Gly Pro 
185 

Gly He Val Ser 

Val Tyr Ala Ser 
220 

Glu He Thr 
235 



Arg Arg Pro Gly 
15 

Asn Ala His Cys 
30 

Lys Lys Lys Lys 
45 

Val He Lys Thr 

Gin Val Leu Ala 
80 

Val Phe Cys Lys 
95 

His Ala Cys Asn 
110 

Arg Trp Gly Ser 
125 

Leu Gin Glu Met 

l,eu Tyr Gly His 
160 

Asp He Leu Asn 
175 

Leu Val Cys Glu 
190 

Trp Gly Arg Gly 
205 

Val Ser Tyr Phe 



<210> 86 
<211> 228 
<212> PRT 

<213> Homo sapiens 



<400> 86 



Arg 


Val 


Ser 


Gly 


Gly 


Arg 


Asp 


Ser 


Val 


Pro 


Ser 


Leu 


Val 


Pro 


Ser 


Thr 


1 








5 










10 










15 




Asn 


Ala 


Tyr 


Asn 


Arg 


Lys 


Arg 


Pro 


Glu 


Asn 


Pro 


His 


Met 


Cys 


Gly 


Gly 






20 










25 










30 






Phe 


Leu 


Ala 
35 


Ser 


Asn 


He 


Glu 


His 
40 


Leu 


Leu 


Cys 


Ala 


Arg 
45 


His 


Arg 


He 


Gin 


Lys 
50 


Ser 


Met 


Thr 


Ser 


Ala 
55 


His 


Arg 


Ser 


Lys 


Val 
60 


Arg 


Arg 


Leu 


Glu 


Ser 


His 


Trp 


Tyr 


Lys 


Gly 


Lys 


Arg 


Lys 


Thr 


Arg 


Ser 


Lys 


Glu 


Lys 


Arg 


65 










70 










75 










80 


Lys 


He 


Phe 


Gly 


Lys 


Tyr 


Thr 


Ser 


Asn 


He 


Asn 


Tyr 


Asp 


He 


Ser 


Leu 








85 










90 










95 




Leu 


Gly 


Leu 


Ala 


Ser 


Pro 


Ala 


Val 


He 


Thr 


Asp 


Lys 


Val 


He 


Pro 


Ala 






100 










105 










110 






Cys 


Leu 


Pro 


Ser 


Pro 


Asn 


Tyr 


Val 


Val 


Ala 


Asp 


Gin 


Thr 


Glu 


Cys 


Tyr 




115 










120 










125 








He 


Thr 


Asp 


Trp 


Gly 


Glu 


Thr 


Gin Gly 


Thr 


Phe 


Gly Ala Gly 


Phe 


Leu 




130 










135 










140 










Lys 


Glu 


Ala 


Gin 


Leu 


Pro 


Val 


He 


Glu 


Asn 


Glu 


Val 


Cys 


Asn 


Arg 


Tyr 


145 










150 










155 










160 


Glu 


Phe 


Leu 


Asn 


Gly Arg 


Val 


Lys 


Ser 


Thr 


Glu 


Leu 


Cys 


Ala Gly His 



165 170 175 
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Leu Ala Gly Gly lie Asp Ser Cys 
180 

Lys Val Ser Leu Phe Gly lie Gly 
195 , 200 

His Phe Tyr Thr Tyr lie His Thr 

210 215 
lie Lys Glu Asn 
225 



Lys Val Arg Lys Asp Gin Glu Thr 
185 190 
Cys Gly Asp Trp. Val Arg Ser Pro 
205 

Tyr Thr Pro Ser He Gin Glu Asn 
220 



<210> 87 

<211> 252 

<212> PRT 

<213> Homo sapiens 

<400> 87 

Arg Lys Leu Gly He Leu Asn His Gin Val Leu Phe Trp Tyr Asn Leu 

15 10 15 

Ser Leu Leu Leu His Phe He Gly Tyr Lys Ser Tyr Ser Glu Pro Leu 

20 25 30 

Ala Leu Phe Gly Glu Asp Asp Asp Met Asp Pro Arg Pro Ser Arg Ser 

35 40 45 

Tyr Gin Val Ala Asn Gly He Ala Val Leu Pro Val Ser Gly Thr Leu 

50 55 60 

Val Ser Lys Thr Arg Ala Leu Gin Pro Tyr Ser Gly Met Thr Gly Tyr 
65 70 75 80 

Asn Gly He He Ala Arg Leu Gin Gin Ala He Ser Asp Pro Gly Val 

85 90 95 

Asp Gly He Leu Leu Asp Met Asp Thr Pro Gly Gly Met Val Ser Gly 

100 105 HO 

Ala Phe Asp Cys Ala Asp He He Ala Arg Met Arg Asp He Lys Pro 

115 120 125 

He Trp Ala Leu Ala Asn Asp Met Asn Cys Ser Ala Gly Gin Leu He 

130 135 140 

Ala Ser Ser Ala Ser Arg Arg Leu Val Thr Gin Thr Ala Arg Thr Gly 
145 150 155 160 

Ser He Gly Val Met Met Ala His Ser Asn Tyr Gly Ala Ala Leu Lys 

165 170 175 

Thr Asn Gly Gly His Met His Thr Tyr Val Tyr Cys Ser Thr He His 

180 185 190 

Asn Ser Lys Asp Leu Lys Pro Thr Gin Met Pro He Asn Asn Arg Leu 

195 200 205 

Asp Lys Glu Asn Val Ala His He His His Gly He Leu Cys Ser His 

210 215 220 

Lys Lys Asp Glu Phe Met Ser Phe Ala Gly Thr Trp Met Lys Leu Glu 
225 " 230 235 240 

Thr He He Leu Ser Lys Leu Thr Gin Glu Gin Lys 
245 250 



<210> 88 
<211> 299 
<212> PRT 

<213> Homo sapiens 
<400> 88 

Met Leu Gly Val Leu Leu Gin He Trp Arg Gly Ser Trp Lys Lys Gin 

15 10 15 

Thr Gin Ala Gin Gly Arg Arg Glu Arg Ser Arg Gin Ala Ala Gly Ala 

20 25 30 

Val Ser Ala Gly Gly Arg Arg Ala Leu Leu Leu Tyr Leu Arg Ala Glu 
35 40 45 



WO 03/040393 



PCT/IB02/04615 



52/58 



Leu Glu Asp Lys Leu Ala Cys Val Asp Ser Arg Leu Arg Leu Val Met 

50 55 60 

Arg Gly Leu Val Leu Gly Arg Ala Ser Gly Ser Ser Val Arg Pro Lys 
65 70 75 80 

Leu Pro Lys Asp Val Arg Ala Asp Phe Gin Thr Arg lie Asp Ala Thr 

85 90 95 

Arg Gin Met Phe Ala Glu Lys Val Ser Ala Tyr Thr Gly Met Ser Val 

100 105 HO 

Gin Asp Val Leu Asp Thr Glu Ala Ala Val Phe Ser Gly Gin Glu Ser 

115 120 125 

Leu Asp Asn Gly Leu Ala Asp Glu Leu Val Asn Asn Thr Asp Ala Leu 

130 135 140 

Gly Val Met Arg Glu Ala Leu Asp Arg Arg Lys Lys Thr Thr Leu Gly 
145 150 155 160 

Gly Thr Met Pro Ser Pro Ser Ala Ser Ala Val Thr Thr Lys Pro Val 

165 170 175 

Asp Gin Ala Ala Thr Gin Thr Thr Ala Ser Ala Glu Gin Ala Thr Thr 

180 185 190 

Val Asp Thr Thr He Ala Ser Val Ala Ala Pro Val Asp Val Ser Ala 

195 200 205 

Gin Val Thr Ala Ala Val Ala Ala Glu Asn Ser Arg He Met Gly He 

210 215 220 

Leu Asn Cys Asp Glu Ala Lys Gly Arg Glu Ser Gin Ala Arg Ala Leu 
225 * " 230 235 240 

Ala Glu Thr Pro Gly Met Thr Val Glu Ser Ala Gin Arg He Leu Ala 

245 250 255 

Ala Ala Pro Gin Ser Ala Gin Met Arg Thr Asp Thr Ala Leu Asp Arg 

260 265 270 

Leu Met Glu Thr Ala Pro Gly Ala Leu Gin Ala Gly Ser Ala Ser Ser 

275 280 285 

Asp Ala Ala Asp Asp Leu Leu Asn Thr Pro Val 
290 295 



<210> 89 
<211> 463 
<212> PRT 

<213> Homo sapiens 
<400> 89 

Thr Asp Pro Trp Phe Ser Lys Gin Trp Tyr Met Asn Ser Glu Ala Gin 

15 10 15 

Pro Asp Leu Ser He Leu Gin Ala Trp Ser Gin Gly Leu Ser Gly Gin 

20 25 30 

Gly He Val Val Ser Val Leu Asp Asp Gly He Glu Lys Asp His Pro 

35 40 45 

Asp Leu Trp Ala Asn Tyr Asp Pro Leu Ala Ser Tyr Asp Phe Asn Asp 

50 55 60 

Tyr Asp Pro Asp Pro Gin Pro Arg Tyr Thr Pro Ser Lys Glu Asn Arg 
65 ' 70 75 80 

His Gly Thr Arg Cys Ala Gly Glu Val Ala Ala Met Ala Asn Asn Gly 

85 90 95 

Phe Cys Gly Val Gly Val Ala Phe Asn Ala Arg He Gly Gly Val Arg 

100 105 110 

Met Leu Asp Gly Thr He Thr Asp Val He Glu Ala Gin Ser Leu Ser 

115 120 125 

Leu Gin Pro Gin His He His He Tyr Ser Ala Ser Trp Gly Pro Glu 

130 135 140 

Asp Asp Gly Arg Thr Val Asp Gly Pro Gly He Leu Thr Arg Glu Ala 
145 ~ 150 155 160 

Phe Arg Arg Gly Val Thr Lys Gly Arg Gly Gly Leu Gly Thr Leu Phe 
165 17 0 175 
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He 


Trp 


Ala 


Ser 


Gly Asn Gly Gly Leu His Tyr Asp 


Asn 


Cys 


Asn 


Cys 








180 






185 






190 






Asp 


Gly Tyr Thr 


Asn 


Ser 


He His Thr Leu Ser 


Val 


Gly 


Ser 


Thr 


Thr 






195 








200 




205 








Gin 


Gin Gly Arg 


Val 


Pro 


Trp Tyr Ser Glu Ala 


Cys 


Ala 


Ser 


Thr 


Leu 




210 










215 


220 










Thr 


Thr 


Thr 


Tyr 


Ser 


Ser Gly Val Ala Thr Asp 


Pro 


Gin 


He 


Val 


Thr 


225 










230 


235 










240 


Thr 


Asp 


Leu 


His 


His 


Gly Cys Thr Asp Gin His 


Thr Gly 


Thr 


Ser 


Ala 










245 




250 








255 




Ser 


Ala 


Pro 


Leu 


Ala Ala Gly Met He Ala Leu 


Ala 


Leu 


Glu 


Ala 


Asn 








260 






265 






270 






Pro 


Phe 


Leu 


Thr 


Trp 


Arg 


Asp Met Gin His Leu 


Val 


Val 


Arg 


Ala 


Ser 






275 








280 




285 








Lys 


Pro 


Ala 


His 


Leu Gin Ala Glu Asp Trp Arg 


Thr 


Asn 


Gly Val 


Gly 




290 










295 


300 










Arg 


Gin 


Val 


Ser 


His 


His 


Tyr Gly Tyr Gly Leu 


Leu 


Asp 


Ala Gly Leu 


305 










310 


315 










320 


Leu 


Val 


Asp 


Thr 


Ala 


Arg 


Thr Trp Leu Pro Thr 


Gin 


Pro 


Gin 


Arg 


Lys 








325 




330 








335 




Cys 


Ala 


Val 


Arg 


Val 


Gin 


.Ser Arg Pro Thr Pro 


He 


Leu 


Pro 


Leu 


He 








340 






345 






350 






Tyr 


He 


Arg 


Glu 


Asn 


Val 


Ser Ala Cys Ala Gly 


Leu 


His 


Asn 


Ser 


He 




355 








360 




365 








Arg 


Ser 


Leu 


Glu 


His 


Val 


Gin Ala Gin Leu Thr 


Leu 


Ser 


Tyr 


Ser 


Arg 


370 










375 


380 










Arg 


Gly Asp 


Leu 


Glu 


He 


Ser Leu Thr Ser Pro 


Met 


Gly 


Thr 


Arg 


Ser 


385 










390 


395 










400 


Thr 


Leu 


Val 


Ala 


He 


Arg 


Pro Leu Asp Val Ser 


Thr 


Glu 


Gly Tyr Asn 










405 




410 








415 




Asn 


Trp 


Val 


Phe 


Met 


Ser 


Thr His Phe Trp Asp 


Glu 


Asn 


Pro 


Gin Gly 








420 






425 






430 






Val 


Trp 


Thr 


Leu 


Gly Leu 


Glu Asn Lys Gly Tyr Tyr 


Phe 


Asn Thr Gly 






435 








440 




445 








Glu 


Gly Gly Ala 


Gly Leu Trp Trp Ala Gly Leu Gly 


Ser 


Pro 


Thr 






450 










455 


460 











<210> 90 
<211> 225 
<212> PRT 

<213> Homo sapiens 



<400> 90 



Met 


Ala 


Ser 


Arg 


Tyr 


Asp 


Arg 


Ala He Thr 


Val 


Phe 


Ser 


Pro Asp Gly 


1 








5 






10 










15 


His 


Leu 


Phe 


Gin 


Val 


Glu 


Tyr 


Ala Gin Glu 


Ala 


Val 


Lys 


Lys 


Gly Ser 








20 








25 








30 




Thr 


Ala 


Val 


Gly 


He 


Arg 


Gly 


Thr Asn He 


Val 


Val 


Leu 


Gly Val Glu 






35 










40 






45 






Lys 


Lys 


Ser 


Val 


Ala 


Lys 


Leu 


Gin Asp Glu Arg 


Thr 


Val 


Arg 


Lys He 


50 










55 






60 








Cys 


Ala 


Leu 


Asp 


Asp 


His 


Val 


Cys Met Ala 


Phe Ala Gly 


Leu 


Thr Ala 


65 










70 






75 








80 


Asp 


Ala 


Arg 


Val 


Val 


He Asn Arg Ala Arg Val 


Glu 


Cys 


Gin 


Ser His 








85 






90 










95 


Lys 


Leu 


Thr 


Val 


Glu 


Asp 


Pro 


Val Thr Val 


Glu 


Tyr 


He 


Thr 


Arg Phe 






100 








105 








110 




He 


Ala 


Thr 


Leu 


Lys 


Gin 


He 


Asn Thr Lys 


Ser 


Tyr 


Leu 


Lys 


Phe Ser 






115 










120 






125 






Arg 


Glu 


Val 


Pro 


Phe 


Leu 


Phe 


Cys Phe Leu 


Phe 


Phe 


Ser 


Trp 


Asp Tyr 


130 










135 






140 
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Arg 


His 


Met 


Pro 


Pro 


His 


Leu 


Ala 


Asn 


Phe 


Phe 


Ala 


Gly 


Tyr 


Lys 


He 


145 










150 










155 










160 


Asn 


Lys 


Gin Lys 


Phe 


Ala 


Ala 


Phe 


Leu 


Tyr 


Ala 


Asn 


Asn 


Glu 


Gin 


Ser 








165 










170 










175 




Glu 


Lys 


Glu 


He 


Lys 


Lys 


Val 


He 


Pro 


Phe 


Met 


He 


Ala 


Thr 


Asn 


Lys 






180 










185 










190 






He 


Lys 


Cys 


He 


Glu 


He 


Asn 


Leu 


Thr 


Lys 


Glu 


Val 


Lys 


Asp 


Phe 


His 




195 










200 










205 








Asn 


Glu 


Asn 


Tyr 


Lys 


Thr 


Leu 


Met 


Gin 


Glu 


Thr 


Glu 


Ala 


Asp 


Thr 


Lys 




210 








215 










220 











Lys 
225 



<210> 91 
<211> 228 
<212> PRT 

<213> Homo sapiens 



<400> 91 



Ser 


Lys 


Gly Gly 


He 


Ser 


Val 


Gly Leu 


Cys 


Val 


Arg Asp 


Gly Val 


Val 


1 






5 










10 








15 




Val 


Val 


Ser Arg 


Asp 


Thr 


Asn 


Ser 


Pro 


His 


Arg 


Val Thr 


Pro 


Leu 


Leu 








20 










25 








30 






Asn 


Glu 


Leu 


Met 


Cys 


Leu 


Arg 


Cys 


Ser 


Gly 


Leu 


Ala Ala 


Ala 


Ala 


Lys 






35 










40 








45 








Met 


Val 


Ala 


Ala 


Phe 


He 


Ser 


Leu Arg Arg 


Ser 


Ala Glu 


He 


Asn 


Lys 




50 










55 










60 








Tyr 


Val 


He 


Tyr 


Pro 


Arg Asp 


Val 


Cys 


Thr 


Pro 


Tyr He 


Val 


Asn 


Arg 


65 








70 










75 








80 


Met 


Ser 


Leu 


He 


Lys 


He 


Lys 


Tyr 


Thr 


Gin 


Ser Asn Gly 


Arg 


Arg 


Pro 










85 










90 








95 




Phe 


Gly 


He 


Ser 


Ala 


Leu 


He 


Val 


Gly 


Phe 


Asp 


Asp Asp 


Gly 


He 


Ser 






100 










105 








110 






Arg 


Leu 


Tyr 


Gin 


Thr Asp 


Pro 


Ser Gly Thr 


Tyr 


His Ala 


Trp 


Lys 


Ala 




115 










120 








125 








Asn 


Ala 


He 


Gly 


Arg 


Ser 


Ala 


Lys 


Thr 


Val 


Arg 


Glu Phe 


Leu 


Glu 


Lys 




130 










135 










140 








Asn 


Tyr 


Thr 


Glu 


Asp 


Ala 


He 


Ala 


Ser 


Asp 


Ser 


Glu Ala 


He 


Lys 


Leu 


145 








150 










155 








160 


Ala 


He 


Lys 


Ala 


Leu 


Leu 


Glu 


Val 


Val 


Gin 


Ser 


Gly Gly 


Lys 


Asn 


He 








165 










170 








175 




Glu 


Leu 


Ala 


He 


He 


Arg Arg 


Asn 


Gin 


Pro 


Leu 


Lys Lys 


Lys 


Glu 


Glu 








180 










185 








190 






Glu 


Glu 


Glu 


Arg 


Arg 


Lys 


Lys 


Lys 


Glu 


Glu 


Glu 


Gly Gly 


Glu 


Glu 


Glu 






195 










200 








205 








Glu 


Glu 


Glu 


Glu 


Glu Glu Asp 


Glu 


Glu 


Glu 


Glu 


Glu Glu 


Val 


Glu 


Glu 




210 










215 










220 








Glu 


Glu 


Glu 


Glu 
























225 































<210> 92 
<211> 1005 
<212> PRT 

<213> Homo sapiens 
<400> 92 

Glu Asn Gly Ser Leu Thr Trp Gin Glu Leu Leu Arg Gin Thr Gly Lys 

1 5 10 15 

Cys Ser He Pro Cys Leu He Asp Thr Gly Ala Gin Ala Asn He He 
20 25 30 
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Thr Glu Glu Thr Val Arg Ala His Lys Leu Pro Thr Arg Pro Trp Ser 

35 40 45 

Lys Ser Val lie Tyr Gly Gly Val Tyr Pro Asn Lys lie Asn Arg Lys 

50 55 60 

Thr lie Lys Leu Asn He Ser Leu Asn Gly He Ser lie Lys Thr Glu 
65 70 75 80 

Phe Leu Val Val Lys Lys Phe Ser His Pro Ala Ala lie Ser Phe Thr 

85 90 95 

Thr Leu Tyr Asp Asn Asn He Glu lie Ser Ser Ser Lys His Thr Leu 

100 105 110 

Ser Gin Met Asn Lys Val Ser Asn He Val Lys Glu Pro Glu Leu Pro 

115 120 125 

Asp He Tyr Lys Glu Phe Lys Asp He Thr Ala Glu Thr Asn Thr Glu 

130 ~ 135 140 

Lys Leu Pro Lys Pro He Lys Gly Leu Glu Phe Glu Val Glu Leu Thr 
145 ~ 150 155 160 

Gin Glu Asn Tyr Arg Leu Pro He Arg Asn Tyr Pro Leu Pro Pro Gly 

165 170 175 

Lys Met Gin Ala Met Asn Asp Glu He Asn Gin Gly Leu Lys Ser Gly 

180 185 190 

He He Arg Glu Ser Lys Ala He Asn Ala Cys Pro Val Met Phe Val 

195 200 2 05 

Pro Lys Lys Glu Gly Thr Leu Arg Met Val Val Asp Tyr Lys Pro Leu 

210 " 215 220 

Asn Lys Tyr Val Lys Pro Asn He Tyr Pro Leu Pro Leu He Glu Gin 
225 * 230 235 240 

Leu Leu Ala Lys He Gin Gly Ser Thr He Phe Thr Lys Leu Asp Leu 

245 250 255 

Lys Ser Ala Tyr His Leu He Arg Val Arg Lys Gly Asp Glu His Lys 

260 265 270 

Leu Ala Phe Arg Cys Pro Arg Gly Val Phe Glu Tyr Leu Val Met Pro 

275 " 280 285 

Tyr Gly He Ser He Ala Pro Ala His Phe Gin Tyr Phe He Asn Thr 

290 295 300 

He Leu Gly Glu Val Lys Glu Ser His Val Val Cys Tyr Met Asp Asn 
305 310 315 320 

He Leu He His Ser Lys Ser Glu Ser Glu His Val Lys His Val Lys 

325 330 335 

Asp Val Leu Gin Lys Leu Lys Asn Ala Asn Leu He He Asn Gin Ala 

340 345 350 

Lys Cys Glu Phe His Gin Ser Gin Val Lys Phe He Gly Tyr His He 

355 360 365 

Ser Glu Lys Gly Phe Thr Pro Cys Gin Glu Asn He Asp Lys Val Leu 

370 375 380 

Gin Trp Lys Gin Pro Lys Asn Arg Lys Glu Leu Arg Gin Phe Leu Gly 
385 390 395 400 

Ser Val Asn Tyr Leu Arg Lys Phe He Pro Lys Thr Ser Gin Leu Thr 

405 410 415 

His Pro Leu Asn Asn Leu Leu Lys Lys Asp Val Arg Trp Lys Trp Thr 

420 425 430 

Pro Thr Gin Thr Gin Ala He Glu Asn He Lys Gin Cys Leu Val Ser 

435 440 445 

Pro Pro Val Leu Arg His Phe Asp Phe Ser Lys Lys He Leu Leu Glu 

450 455 460 

Thr Asp Ala Ser Asp Val Ala Val Gly Ala Val Leu Ser Gin Lys His 
465 470 475 480 

Asp Asp Asp Lys Tyr Tyr Pro Val Gly Tyr Tyr Ser Ala Lys Met Ser 

485 490 495 

Lys Ala Gin Leu Asn Tyr Ser Val Ser Asp Lys Glu Met Leu Ala He 

500 505 510 

He Lys Ser Leu Lys His Trp Arg His Tyr Leu Glu Ser Thr He Glu 
515 520 525 
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Pro 


Phe 


Lys 


He 


Leu 


Thr Asp His Arg 


Asn 


Leu 


He 


Gly Arg 


He 


Thr 




53 0 










535 










540 










Asn 


Glu 


Ser 


Glu 


Pro 


Glu 


Asn 


Lys 


Arg 


Leu 


Ala 


Arg 


Trp 


Gin 


Leu 


Phe 


C A C 










550 










555 










560 


Leu 


Gin 


Asp 


Phe 


Asn 


Phe 


Glu 


He 


Asn 


Tyr 


Arg 


Pro Gly Ser Ala Asn 










565 










570 










575 




His 


lie 


Ala 


Asp 


Ala 


Leu 


Ser 


Arg 


He 


Val 


Asp 


Glu 


Thr 


Glu 


Pro 


He 








580 










585 










590 






Pro 


Lys 


Asp 


Ser 


Glu 


Asp 


Asn 


Ser 


He 


Asn 


Phe 


Val 


Asn 


Gin 


He 


Ser 






595 










600 










605 








lie 


Thr 


Asp 


Asp 


Phe 


Lys 


Asn 


Gin 


Val 


Val 


Thr 


Glu 


Tyr 


Thr 


Asn Asp 




610 










615 










620 










Thr 


Lys 


Leu 


Leu 


Asn 


Leu 


Leu 


Asn 


Asn 


Glu 


Asp 


Lys 


Arg 


Val 


Glu 


Glu 


625 










630 










635 










640 


Asn 


He 


Gin 


Leu 


Lys 


Asp Gly Leu Leu 


He 


Asn 


Ser 


Lys Asp Gin 


He 










645 










650 










655 




Leu 


Leu 


Pro 


Asn 


Asp 


Thr 


Gin 


Leu 


Thr 


Arg 


Thr 


He 


He 


Lys 


Lys 


Tyr 








660 










665 










670 






His 


Glu 


Glu 


Gly Lys 


Leu 


He 


His 


Pro 


Gly 


He 


Glu 


Leu 


Leu 


Thr 


Asn 






675 










680 










685 








He 


He 


Leu 


Arg 


Arg 


Phe 


Thr 


Trp' 


Lys 


Gly 


He 


Arg 


Lys 


Gin 


He 


Gin 




690 










695 










700 










Glu 


Tyr 


Val 


Gin 


Asn 


Cys 


His 


Thr 


Cys 


Gin 


He 


Asn 


Lys 


Ser 


Arg 


Asn 


705 










710 










715 










720 


His 


Lys 


Pro 


Tyr 


Gly 


Pro 


Leu 


Gin 


Pro 


He 


Pro 


Pro 


Ser Glu Arg 


Pro 










725 










730 










735 




Trp 


Glu 


Ser 


Leu 


Ser 


Met 


Asp 


Phe 


He 


Thr 


Ala 


Leu 


Pro 


Glu 


Ser 


Ser 








740 










745 










750 






Gly 


Tyr 


Asn 


Ala 


Leu 


Phe 


Val 


Val 


Val 


Asp 


Arg 


Phe 


Ser 


Lys 


Met 


Ala 






755 










760 










765 








lie 


Leu 


Val 


Pro 


Cys 


Thr 


Lys 


Ser 


He 


Thr 


Ala 


Glu 


Gin 


Thr 


Ala 


Arg 




770 










775 










780 










Met 


Phe 


Asp 


Gin 


Arg 


Val 


He 


Ala 


Tyr 


Phe 


Gly 


Asn 


Pro 


Lys 


Glu 


He 


785 










790 










795 










800 


He 


Ala 


Asp 


Asn 


Asp 


His 


He 


Phe 


Thr 


Ser 


Gin 


Thr 


Trp 


Lys 


Asp 


Phe 










805 










810 










815 




Ala 


His 


Lys 


Tyr 


Asn 


Phe 


Val 


Met 


Lys 


Phe 


Ser 


Leu 


Pro 


Tyr Arg 


Pro 








820 










825 










830 






Gin 


Thr 


Asp 


Gly Gin 


Thr 


Glu 


Arg 


Thr 


Asn 


Gin 


Thr 


Val 


Glu 


Lys 


Leu 






8 35 










840 










845 








Leu 


Arg 


Cys 


Val 


Cys 


Ser 


Thr 


His 


Pro 


Asn 


Thr 


Trp 


Val 


Asp 


His 


He 




850 










855 










860 










Ser 


Leu 


Val 


Gin 


Gin 


Ser Tyr Asn Asn 


Ala 


He 


His 


Ser 


Ala 


Thr 


Gin 


865 










870 










875 










880 


Met 


Thr 


Pro 


Phe 


Glu 


He 


Val 


His 


Arg 


Tyr 


Ser 


Pro 


Ala 


Leu 


Ser 


Pro 










885 










890 










895 




Leu 


Glu 


Leu 


Pro 


Ser 


Phe 


Ser 


Asp 


Lys 


Thr 


Asp 


Glu 


Asn 


Ser 


Gin 


Glu 








900 










.905 










910 






Thr 


He 


Gin 


Val 


Phe 


Gin 


Thr 


Val 


Lys 


Glu 


His 


Leu 


Asn 


Thr 


Asn 


Asn 






915 










920 










925 








He 


Lys 


Met 


Lys 


Lys 


Tyr Phe Asp 


Met 


Lys 


He 


Gin 


Glu 


He 


Glu 


Glu 




930 










935 










940 










Phe 


Gin 


Pro 


Gly Asp 


Leu 


Val 


Met 


Val 


Lys 


Arg 


Thr 


Lys 


Thr 


Ala 


Phe 


945 










950 










955 










960 


Leu 


Tyr 


Thr 


Asn 


Asn 


Arg 


Gin 


Thr 


Glu 


Ser 


Gin 


He 


Met 


Ser 


Glu 


Leu 










965 










970 










975 




Pro 


Phe 


Thr 


He 


Ala 


Ser Lys 


Arg 


He 


Lys 


Tyr 


Leu Gly He 


Gin 


Leu 








980 










985 










990 






Thr 


Arg 


Glu 


Val 


Lys 


Asp 


Leu 


Phe 


Lys 


Glu 


Asn 


Tyr 


Lys 









995 1000 1005 
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<210> 93 

<211> 195 

<212> PRT 

<213> Homo sapiens 



<400> 93 



Gly 


Pro 


Arg 


Leu 


Ala 


His Gly Thr 


Thr 


Thr 


Leu 


Ala 


Phe Arg Phe Arg 


1 








5 








10 










15 


His 


Gly 


Val 


lie 


Ala 


Ala Ala 


Asp 


Thr Arg 


Ser 


Ser 'Cys 


Gly Ser Tyr 








20 








25 










30 




Val 


Ala 


Cys 


Pro 


Ala 


Ser Cys 


Lys 


Val 


He 


Pro 


Val His 


Gin 


His Leu 






35 








40 










45 






Leu 


Gly 


Thr 


Thr 


Ser Gly Thr 


Ser 


Ala 


Asp 


Cys 


Ala 


Thr 


Trp 


Tyr Arg 




50 








55 










60 








Val 


Leu 


Gin 


Arg 


Glu 


Leu Arg 


Leu 


Arg 


Glu 


Leu 


Arg 


Glu 


Gly Gin Leu 


65 










70 








75 








80 


Pro 


Ser 


Val 


Ala 


Ser 
85 


Ala Ala 


Lys 


Leu 


Leu 
90 


Ser 


Ala 


Met 


Met 


Ser Gin 
95 


Tyr 


Arg 


Gly 


Leu 
100 


Asp 


Leu Cys 


Val 


Ala 
105 


Thr 


Ala 


Leu 


Cys 


Gly 
110 


Trp Asp 


Arg 


Ser 


Gly 
115 


Pro 


Glu 


Leu Phe 


Tyr 
120 


Val 


Tyr 


Ser 


Asp 


Gly 
125 


Thr 


Arg Leu 


Gin 


Gly 


Asp 


He 


Phe 


Ser Val 


Gly 


Ser Gly Ser 


Pro 


Tyr 


Ala 


Tyr Gly 




130 








135 










14 0 








Val 


Leu 


Asp 


Arg 


Gly Tyr Arg Tyr 


Asp 


Met 


Ser 


Thr 


Gin 


Glu 


Ala Tyr 


145 










150 








155 








160 


Ala 


Leu 


Ala 


Arg 


Cys 
165 


Ala Val 


Ala 


His 


Ala 
170 


Thr 


His 


Arg 


Asp 


Ala Tyr 
175 


Ser 


Gly 


Gly 


Ser 


Val 


Asp Leu 


Phe 


His 


Val 


Arg 


Glu 


Ser 


Gly Trp Glu 








180 








185 










190 




His 


Val 


Ser 
195 

























<210> 94 
<211> 198 
<212> PRT 

<213> Homo sapiens 



<400> 94 



Ser 


He 


Met 


Ser 


Tyr 


Asn 


Gly Gly 


Ala 


He 


Met 


Ala 


Met Lys 


Gly 


Lys 


1 








5 










10 








15 




Asn 


Arg 


Val 


Ala 


He Ala Ala Asp 


Arg 


His 


Phe 


Gly 


He Gin 


Ala 


Gin 








20 










25 








30 






Met 


Val 


Thr 


Thr 


Asp 


Phe 


Gin 


Glu 


He 


Phe 


Pro 


Met 


Gly Gly 


Trp 


Leu 






35 










40 










45 






Tyr 


He 


Gly 


Leu 


Ala 


Gly Leu Ala 


Thr Asp Val 


Gin Arg Val 


Ala 


Gin 




50 










55 










60 








Cys 


Leu 


Lys 


Phe 


Gin 


Leu 


Asn 


Leu 


Tyr 


Glu 


Leu 


Lys 


Glu Gly Gin Gin 


65 










70 










75 








80 


He 


Lys 


Pro 


Tyr 


Thr 


Phe 


Thr 


Ser 


Met 


Val 


Ala 


Asn 


Phe Leu 


Tyr 


Glu 










85 










90 








95 




Lys 


His 


Phe 


Gly 


Pro 


Tyr 


Tyr 


Thr 


Asp 


Pro 


Val 


He 


Ala Gly 


Leu 


Asp 








100 










105 








no 






Leu 


Lys 


Thr 


Phe 


Lys 


Pro 


Phe 


Ser 


Cys 


Ser 


Leu 


Asp 


Leu He 


Gly 


Phe 






115 










120 










125 






Pro 


Met 


Val 


Thr 


Asp Asp 


Phe 


Val 


Val 


Asn Gly 


Ser 


Tyr Ala 


Glu 


Gin 




130 










135 










140 








Met 


Tyr 


Gly 


Met 


Cys 


Glu 


Ser 


Leu 


Trp 


Glu 


Pro 


Asn 


Met Asp 


Pro 


Glu 


145 










150 










155 








160 


His 


Pro 


Phe 


Glu 


Thr 


He 


Ser 


Pro 


Ala 


Met 


Leu 


Asn 


Ala Val 


Asp 


Trp 



165 170 175 
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Gly Ala Gly Ser Gly Met Gly Val lie lie His lie Thr Lys Lys Asp 

180 185 190 

Lys lie Thr Thr Arg Thr 
195 



